Ill-behaved bots are nothing new for Yahoo (some of which are blocked via robots.txt & .htaccess)... But I'd like to at least control the activities of .crawl.yahoo.net - a very useful crawler.
Unfortunately, many of the below pages (in meta_tags.php) are still being indexed:
define('ROBOTS_PAGES_TO_SKIP','login,logoff,create_account,account,
account_edit,account_history,account_history_info,account_newsletters,
account_notifications,account_password,address_book,advanced_search,
advanced_search_result,checkout_success,checkout_process,
checkout_shipping,checkout_payment,checkout_confirmation,conditions,
cookie_usage,create_account_success,contact_us,download,
download_timeout,customers_authorization,down_for_maintenance,
password_forgotten,time_out,unsubscribe,info_shopping_cart,
popup_image,popup_image_additional,product_reviews_write,page_2,
page_3,page_4,privacy,shippinginfo,ssl_check,tell_a_friend');
I'm not fond of having my Conditions, Privacy, Shipping Info, FAQs, Ordering and other such pages come up whenever someone searches for my store... Any tips or suggestions would be appreciated!
Bookmarks