ambalayellowpages.com
robots.txt

Robots Exclusion Standard data for ambalayellowpages.com

Resource Scan

Scan Details

Site Domain ambalayellowpages.com
Base Domain ambalayellowpages.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-29T09:06:28+00:00
Next Scan 2025-11-28T09:06:28+00:00

Last Successful Scan

Scanned2025-09-29T18:47:47+00:00
URL https://ambalayellowpages.com/robots.txt
Redirect https://www.ambalayellowpages.com/robots.txt
Redirect Domain www.ambalayellowpages.com
Redirect Base ambalayellowpages.com
Domain IPs 43.230.202.147
Redirect IPs 43.230.202.147
Response IP 43.230.202.147
Found Yes
Hash 00e9cd877a8ab9a919f7675675c2f387500c9eeecdf98816b5c6d70f0d2162de
SimHash 25d058c25551

Groups

msnbot

Rule Path
Disallow

altavista

Rule Path
Disallow

alltheweb

Rule Path
Disallow

aol

Rule Path
Disallow

googlebot

Rule Path
Disallow

inktomi slurp

Rule Path
Disallow

alexa (ia archiver)

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

wisenutbot

Rule Path
Disallow

surveybot

Rule Path
Disallow

teoma

Rule Path
Disallow

gigabot

Rule Path
Disallow

scrubby

Rule Path
Disallow

robozilla

Rule Path
Disallow

nutch

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

yahoo-mmcrawler

Rule Path
Disallow

psbot

Rule Path
Disallow

asterias

Rule Path
Disallow

yahoo-blogs/v3.9

Rule Path
Disallow

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow
Disallow /cgi-bin/

Comments

  • robots.txt