Key word Albums & Google Web Crawl

sambobb
sambobb's picture

Joined: 2006-07-22
Posts: 38
Posted: Tue, 2008-08-19 06:33


Gallery version = 2.2.4 core 1.2.0.6
PHP version = 5.2.5 cgi
Webserver = Apache/2.2.8 (Unix) mod_ssl/2.2.8 OpenSSL/0.9.8b mod_auth_passthrough/2.1 mod_bwlimited/1.4
Database = mysqli 5.0.45-community, lock.system=flock
Toolkits = Gd, Exif
Acceleration = none, none
Operating system = Linux omega.ausweb.com.au 2.6.18-53.1.13.el5 #1 SMP Tue Feb 12 13:02:30 EST 2008 x86_64
Default theme = matrix
gettext = enabled
Locale = en_GB
Browser = Mozilla/5.0 (Windows; U; Windows NT 5.1; en-GB; rv:1.8.1.14) Gecko/20080404 Firefox/2.0.0.14
Rows in GalleryAccessMap table = 65
Rows in GalleryAccessSubscriberMap table = 1126
Rows in GalleryUser table = 2
Rows in GalleryItem table = 1126
Rows in GalleryAlbumItem table = 42
Rows in GalleryCacheMap table = 0

I am encountering some strange returns from Webmaster tools in that the webcrawl results list a load of unreachable URL's which on further investigation appears to be keyword albums.( The are over 5,000 in total) eg:
http://www.darlingriverrun.com.au/stock/main.php?g2_controller=exif.SwitchDetailMode&g2_mode=detailed&g2_return=%2Fstock%2Fmain.php%3Fg2_view%3Dkeyalbum.KeywordAlbum%26g2_keyword%3DArid%26g2_itemId%3D980&g2_returnName=keyword+album

I removed the keyword album plugin months ago (and optimised dbase) but it is still detecting the problem (last webcrawl was 2 days ago).

Is this just old cache or is this a real problem - if so what steps do I need to take to rectify.

The Site is: http://www.darlingriverrun.com.au/stock/main.php

Thanks in advance for any help.

 
france-japon

Joined: 2009-04-21
Posts: 1
Posted: Tue, 2009-04-21 02:33

Hello,

I have the same problem. Did you find any solution.

http://france-japon.net/albumphotos/main.php/main.php?g2_controller=exif.SwitchDetailMode&g2_mode=detailed&g2_return=%2Falbumphotos%2Fv%2Fart-artistes-japon%2Fshiganaoya%2FPC290028.JPG.html

404 (Introuvable),1 pages,16/04/09

Also, this kind of link doesn't work and therefore is reported by google for webmasters:
http://france-japon.net/albumphotos/v/Sakura/IMG_0177.JPG/slideshowapplet.html/srss/2068

 
sambobb
sambobb's picture

Joined: 2006-07-22
Posts: 38
Posted: Tue, 2009-04-21 07:24

Hi france-japon...

mo unfortunately... no reply... no answer...

 
jens_k

Joined: 2007-01-28
Posts: 244
Posted: Tue, 2009-04-21 18:28

Hi sambob, hi france-japon.

as far as I could see you are using a robots.txt.
I would add a line as e.g.
Disallow: /stock/main.php?g2_controller=exif.SwitchDetailMode&g2_mode*
Disallow: /albumphotos/main.php/main.php?g2_controller=exif.SwitchDetailMode*

Google accepts wildcards in the file robots.txt. I would suggest to analyse the results in the webmaster tools to identify such strings to exclude.

For my site this was helping me to avoid double content in google too.

Cheers,
Jens

___________________________________
http://jekophoto.eu | http://jekophoto.de