# robots.txt file (make sure the filename is ALL LOWERCASE on Linux/Unix systems) # This file should go in your web site's ROOT directory # The root directory is where your site's main /index.html file would be found # It is usually found in /yourhomedir/public_html/ or /yourhomedir/httpdocs # Where "yourhomedir" is your user account's name # This says to apply these settings to ALL search engine spiders/crawlers User-agent: * Disallow: /cstalk Disallow: /cooktalk Disallow: /cgi-bin # These settings will keep spiders from indexing your unwanted pages # This assumes that your OSC install is in your web site's ROOT directory # ie: http://www.yoursite.com/catalog/index.php Disallow: /catalog/admin Disallow: /catalog/assets Disallow: /catalog/download Disallow: /catalog/images Disallow: /catalog/includes Disallow: /catalog/account.php Disallow: /catalog/account_edit.php Disallow: /catalog/account_history.php Disallow: /catalog/account_history_info.php Disallow: /catalog/account_newsletters.php Disallow: /catalog/account_notifications.php Disallow: /catalog/account_password.php Disallow: /catalog/address_book.php Disallow: /catalog/address_book_process.php Disallow: /catalog/advanced_search.php Disallow: /catalog/advanced_search_result.php Disallow: /catalog/article_info.php Disallow: /catalog/article_print.php Disallow: /catalog/article_reviews.php Disallow: /catalog/article_reviews_info.php Disallow: /catalog/checkout_confirmation.php Disallow: /catalog/checkout_payment.php Disallow: /catalog/checkout_payment_address.php Disallow: /catalog/checkout_process.php Disallow: /catalog/checkout_shipping.php Disallow: /catalog/checkout_shipping_address.php Disallow: /catalog/checkout_success.php Disallow: /catalog/conditions.php Disallow: /catalog/cookie_usage.php Disallow: /catalog/create_account.php Disallow: /catalog/create_account_success.php Disallow: /catalog/download.php Disallow: /catalog/google_sitemap.php Disallow: /catalog/info_shopping_cart.php Disallow: /catalog/login.php Disallow: /catalog/login.php Disallow: /catalog/password_forgotten.php Disallow: /catalog/popup_image.php Disallow: /catalog/popup_search_help.php Disallow: /catalog/product_print.php Disallow: /catalog/product_reviews_write.php Disallow: /catalog/products_new.php Disallow: /catalog/redirect.php Disallow: /catalog/shipping.php Disallow: /catalog/shopping_cart.php Disallow: /catalog/specials.php Disallow: /catalog/ssl_check.php Disallow: /catalog/tell_a_friend.php # Hidden Categories Disallow: /catalog/new-c-150.html Disallow: /catalog/newmd-c-152.html Disallow: /catalog/user-submissions-c-243.html # Feel free to add any other pages on your site that you don't want to be indexed by # the search engines. # PLEASE NOTE: Any pages that you list here should be secured by other means if you # don't want people to be able to view them, as some malicious users will look at a # robots.txt file to try to find "hidden" or "secret" areas of web sites to find # confidential information. # Just Uncomment a line or add new ones as you see fit. Disallow: /private Disallow: /hidden # IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES # THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE. # I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID # OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: /CT SHOTS # AND NOT EVEN BROWSING THE SITE. User-agent: Googlebot-Image Disallow: /w: /