News:

The anti-spam plugins have stopped being effective. Registration is back to requiring approval. After registering, you must ALSO email me with your username, so that I can manually approve your account.

Main Menu

Mysterious 404s

Started by Miluette, April 16, 2009, 06:59:23 AM

Previous topic - Next topic

0 Members and 1 Guest are viewing this topic.

Miluette

Recently I changed the entire archives for both my webcomics. There should be absolutely no way to access the old files anymore, as they no longer exist. Yet, according to AWStats (which is very good at helping me weed out broken links), lots of people are still somehow clicking to old pages of mine.

I did realize part of it was due to forum posts on my own forum, which I changed, but it still happens with a variety of older pages that I'm sure I didn't link anywhere anytime recently (or ever). I thought maybe people have bookmarked them too, but then that still shouldn't happen so much because I have much doubt people bookmarked random old pages of mine!

Some other 404s seem strange to me too, lol. (And the AWStats Wii icon is broken apparently. |D) Anyone else have strange 404s?
And wasn't it you who told me,
"The sun would always chase the day"?

tapewolf

I have had a few people accessing the HTML documents on mine long after I switched to PHP.  These are all from unidentified external sources so I'm not really sure what to do about that, aside from setting up loads and loads of redirects.  They do finally seem to be tailing off, though, so I'm not as bothered.

The most weird 404s I have been getting were these three:

/images/trans.gif
/java/prototype.js
/java/scriptaculous.js

...I really don't know what they are.  Nothing to do with me at all as far as I can tell.  Eventually, because these things were being continually hammered hundreds of times a month, I created a 1-pixel GIF file and some empty .js files to prevent it drowning out the noise.


"The main difficulty is getting [Qa'Dar] out of his cage.
Far and away the most reliable method I have found is mass-murder." -- The IT-HE guide to Morrowind

Databits

#2
Keep in mind that things like Google (and other search engines) tend to catalog every single bit of your site that it's allowed to (restricted by /robots.txt). So the 404 activity access may very well be search engines attempting to update their records for the things you've removed. That or you managed to kill off some hot linked images. :P

For those who don't know about robots.txt, this is a good start:
http://www.robotstxt.org/

It's simply a text file, there's not really a whole lot to it. But it's useful if you don't want some things being indexed by search engines.
(\_/)    ~Relakuyae D'Selemae
(o.O)    
(")_(")  [Libre Office] [Chrome]

Miluette

I was gonna look into the robots.txt thing eventually.

The things still being accessed are still in areas I wouldn't mind being indexed. On that note, I think I know what's causing it, or part of. *runs to Google webmasters~*
And wasn't it you who told me,
"The sun would always chase the day"?