Odd behavior by "Everybody" viewers

lshick

Joined: 2007-08-19
Posts: 12
Posted: Sat, 2009-11-07 17:15

As far as I can tell, G2 is working perfectly, but I stumbled across some odd behavior by some (?) users in viewing my Gallery and wondered whether it rang any bells with anyone.

I have an open G2 site, just 1 admin (me) and "Everybody," with "Everybody" having only view permissions. Of the roughly 2000 photos on my site, I discovered accidentally that a couple of the photos were intermittently getting hundred of hits per hour, many, many times my normal traffic rate. I verified (I think) that the photos are the ones I posted (nobody substituted any porn).

The hits don't show up on Google Analytics. That tells me that whoever/whatever is hitting the site has shut down Javascript processing.

Does this sound like anything that anyone has seen?

Thanks.

 
nivekiam
nivekiam's picture

Joined: 2002-12-10
Posts: 16504
Posted: Sat, 2009-11-07 18:27
Quote:
That tells me that whoever/whatever is hitting the site has shut down Javascript processing.

If you're talking about the view count in Gallery, Gallery does not use JS to count views. It also only counts 1 view per visitor for every 24 hours. Now, if these people had cookies disabled and were constantly reloading the page that could cause it. Gallery2 (you don't mention which version) also has protections to prevent search engines from incrementing the view count. I know it works for Google, Ask Jeeves, msnbot, Yandex, StackRambler and ConveraCrawler. There could be another search engine out there not playing nice with a few of your images though.
____________________________________________
Like Gallery? Like the support? Donate now!!! See G2 live here

 
lshick

Joined: 2007-08-19
Posts: 12
Posted: Sat, 2009-11-07 19:24

Nivekiam,

Thanks. Sorry, I did skip over a bit there. My comment on JS was with respect to Google Analytics. I *am* seeing the counts in Gallery's DB, for the affected 2 or 3 pages, hundreds in an hour. So I suspect that someone has a made himself a user-agent style bot that isn't on the list (natch), that for whatever reason is whaling away at those 2 or 3 photos (that I've seen).

As long as it's not some sort of attack that--I don't know--is trying a brute-force method to crack a password or something, I can't say I really mind.

I was just wondering whether anyone had seen something similar.

 
nivekiam
nivekiam's picture

Joined: 2002-12-10
Posts: 16504
Posted: Sat, 2009-11-07 19:32

Take a look at your access logs and search for those specific URLs, it could very well be that someone has linked to your photos on some popular blog and you're getting nailed and paying for the bandwidth.
____________________________________________
Like Gallery? Like the support? Donate now!!! See G2 live here

 
lshick

Joined: 2007-08-19
Posts: 12
Posted: Sat, 2009-11-07 20:49

Ah, verrry interesting!

Here's a typical line:
87.118.102.72 - - [07/Nov/2009:12:41:06 -0800] "GET /photos/main.php?g2_view=core.ShowItem&g2_itemId=646&g2_GALLERYSID=13d6ea22aeed5f0692f2fec37b7893f9 HTTP/1.1" 200 14200 "-" "Mozilla/5.0 (compatible; MJ12bot/v1.2.5; http://www.majestic12.co.uk/bot.php?+)" "www.sv-moira.com"

These come in a few seconds apart, with the same g2_itemId but each line has a different g2_GALLERYSID.

"majestic12.co.uk" has a site that claims they're building a new kind of web crawler/index engine. Maybe so, but how many times do they have do hit one photo to index it? Hundreds?

Not nice.

 
lshick

Joined: 2007-08-19
Posts: 12
Posted: Sat, 2009-11-07 21:23

I'll take that back---well, half of it. Their website has a nice page on how to use robots.txt to slow down or block their 'bot. It's early innings, but it does look for the moment as though they actually obey the directives (and that the 'bot in question was in fact Majestic12, and not some other 'bot masquerading as them).

Still would be nice to know why their 'bot found it necessary to pile-drive on the head of those two photos!