valleyymca.org
robots.txt

Robots Exclusion Standard data for valleyymca.org

Resource Scan

Scan Details

Site Domain valleyymca.org
Base Domain valleyymca.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-13T18:09:52+00:00
Next Scan 2024-11-11T18:09:52+00:00

Last Successful Scan

Scanned2024-03-24T18:08:19+00:00
URL https://valleyymca.org/robots.txt
Domain IPs 151.106.32.15
Response IP 151.106.32.15
Found Yes
Hash 06875f6a379cf33fcf33a24f688b37a215b3e86f95a04fde3e8340c60f266070
SimHash 515509e3e2f5

Groups

wprocketbot

Rule Path
Allow /

*

Rule Path
Disallow /*blackhole
Disallow /?blackhole

*

Rule Path
Disallow /*?ddownload=*

*

Rule Path
Disallow /cgi-bin
Disallow /trackback
Disallow /category/*/*
Disallow */trackback

googlebot

Rule Path
Disallow /*.pdf$

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

ninjabot

Rule Path
Allow /*

adsbot-google

Rule Path
Allow /*

googlebot-mobile

Rule Path
Allow /*

Other Records

Field Value
sitemap https://valleyymca.org/sitemap_index.xml

Comments

  • No PDF
  • Google Image
  • Google AdSense
  • Internet Archiver Wayback Machine
  • digg mirror
  • Google
  • Google Ads
  • Google Mobile