aska.ms
robots.txt

Robots Exclusion Standard data for aska.ms

Resource Scan

Scan Details

Site Domain aska.ms
Base Domain aska.ms
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-10-08T20:06:43+00:00
Next Scan 2026-01-06T20:06:43+00:00

Last Successful Scan

Scanned2023-03-24T19:30:52+00:00
URL http://aska.ms/robots.txt
Domain IPs 37.230.114.37
Response IP 37.230.114.37
Found Yes
Hash c5ac2c56181febb5f1b0c4f809ef54276d5d9d5ff96aa5d18bf7b772d695c583
SimHash 291e6a014ed4

Groups

mediapartners-google*

Rule Path
Disallow

*

Rule Path
Disallow /admin/
Disallow /download/
Disallow /includes/
Disallow /temp/
Disallow /debug/
Disallow /tmp/
Disallow /pub/
Disallow /templates/
Disallow /urchin/
Disallow /admin/
Disallow /download/
Disallow /font/
Disallow /includes/
Disallow /install/
Disallow /tmp/
Disallow /templates/
Disallow /m1_export_rss.php
Disallow /address_book_process.php
Disallow /account.php
Disallow /account_edit.php
Disallow /account_edit_process.php
Disallow /account_history.php
Disallow /account_history_info.php
Disallow /address_book.php
Disallow /checkout_process.php
Disallow /advanced_search.php
Disallow /advanced_search_result.php
Disallow /checkout_address.php
Disallow /checkout_confirmation.php
Disallow /checkout_payment.php
Disallow /checkout_success.php
Disallow /conditions.php
Disallow /contact_us.php
Disallow /create_account.php
Disallow /create_account_process.php
Disallow /create_account_success.php
Disallow /info_shopping_cart.php
Disallow /login.php
Disallow /logoff.php
Disallow /password_forgotten.php
Disallow /popup_image.php
Disallow /popup_search_help.php
Disallow /privacy.php
Disallow /product_notifications.php
Disallow /product_reviews.php
Disallow /product_reviews_info.php
Disallow /product_reviews_write.php
Disallow /redirect.php
Disallow /reviews.php
Disallow /shipping.php
Disallow /shopping_cart.php
Disallow /tell_a_friend.php
Disallow /cookie_usage.php

Other Records

Field Value
sitemap http://www.askabilliards.com/xml_sitemap.php

Comments

  • CRELoaded Generated Robots.txt
  • Robot Exclusion File -- robots.txt
  • Author: CRELoaded Team
  • Last Updated : May 11th 2005
  • enhancements by Ted C
  • User-agent: *
  • *Disallow: crawlers.looksmart.com
  • Block All Images
  • Disallow: *.gif
  • Disallow: *.jpg
  • Disallow: /*.gif$
  • Disallow: /*.jpg$
  • To block main page due to size from bots
  • Disallow: /index.php
  • To shut down site completely from bots use either
  • Disallow: /
  • If you had search engine friendly URLs on and want to clean them out of the bots
  • Disallow: /default.php/cPath/
  • Block out directories bots don't need to go to
  • Block out things that are secure or login oriented
  • IF YOU DO NOT WISH TO HAVE THE GOOGLE IMAGE BOT SCAN YOUR DOMAIN FOR IMAGES
  • THEN YOU CAN INCLUDE THE FOLLOWING IN YOUR ROBOTS FILE.
  • I FOUND THAT MY BANDWIDTH USAGE DROPPED BY A MASSIVE AMOUNT AFTER I GOT RID
  • OF THE GOOGLE IMAGE BOT. ALL I HAD WAS IMAGE HUNTERS STEALING PRODUCT SHOTS
  • AND NOT EVEN BROWSING THE SITE.
  • User-agent: Googlebot-Image
  • Disallow: /