inblighty.com
robots.txt

Robots Exclusion Standard data for inblighty.com

Resource Scan

Scan Details

Site Domain inblighty.com
Base Domain inblighty.com
Scan Status Ok
Last Scan2024-10-29T03:09:40+00:00
Next Scan 2024-11-28T03:09:40+00:00

Last Scan

Scanned2024-10-29T03:09:40+00:00
URL https://inblighty.com/robots.txt
Domain IPs 50.115.18.80
Response IP 50.115.18.80
Found Yes
Hash 8555f4caf62f1d1c7943c7386c890e59c4522695caa93f08b88631a6e176986e
SimHash 880e2f5e5335

Groups

*

Rule Path
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /cgi-bin
Disallow /test
Disallow /trekbooksold
Disallow /homeold
Disallow /links
Disallow /myinc
Allow /dashboard/travel-tools.php
Disallow /dashboard/
Allow /dashboard/travel-tools.php
Disallow /dashboard/tools
Disallow /BStemplates/
Allow /siteinfo/credits.php
Allow /widgets/travel-widgets.php
Allow /widgets/country-advice-widget2.php
Disallow /widgets
Disallow /siteinfo
Disallow /trekbooks/trekabout.html
Disallow /trekbooks/trekprivacy.html
Disallow /trekbooks/trekcontact.html
Disallow /trekbooks/trekorders.html
Disallow /trekbooks/tnc.html
Disallow /trekbooks/trekcontact2.html
Disallow /trekbooks/trekcontact.php
Disallow /trekbooks/tnc.html
Disallow /amzcom.php
Disallow /amzuk.php
Disallow /go.php

googlebot-image

Rule Path
Disallow /

yandex

Rule Path Comment
Disallow / blocks access to the entire site

linkwalker/2.0

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

Other Records

Field Value
sitemap http://inblighty.com/sitemap.xml

Comments

  • comment