ymcasenc.org
robots.txt

Robots Exclusion Standard data for ymcasenc.org

Resource Scan

Scan Details

Site Domain ymcasenc.org
Base Domain ymcasenc.org
Scan Status Ok
Last Scan2025-09-07T19:23:28+00:00
Next Scan 2025-10-07T19:23:28+00:00

Last Scan

Scanned2025-09-07T19:23:28+00:00
URL https://ymcasenc.org/robots.txt
Redirect https://www.ymcasenc.org/robots.txt
Redirect Domain www.ymcasenc.org
Redirect Base ymcasenc.org
Domain IPs 104.26.4.215, 104.26.5.215, 172.67.69.80, 2606:4700:20::681a:4d7, 2606:4700:20::681a:5d7, 2606:4700:20::ac43:4550
Redirect IPs 104.26.4.215, 104.26.5.215, 172.67.69.80, 2606:4700:20::681a:4d7, 2606:4700:20::681a:5d7, 2606:4700:20::ac43:4550
Response IP 104.26.5.215
Found Yes
Hash d3265b84a26a702371b79d6f34f27b67b5480b4819a49f75b98b09202bd67467
SimHash 25c9d241e532

Groups

*

Rule Path
Disallow /events/?continue=
Disallow /*srctype%3Dglance*
Disallow /*direct%3Dy*
Disallow /*direct%3Dy
Disallow /*minical*
Disallow /index.php?*print=pdf*
Disallow /*src%3Dreminder*
Disallow /*print%3Dpdf*

Other Records

Field Value
crawl-delay 5

Comments

  • ROBOTS.TXT
  • asoft200303.accrisoft.com
  • Google
  • User-agent: Googlebot
  • Disallow:
  • Yahoo
  • User-agent: Slurp
  • Disallow:
  • Alta-Vista
  • User-agent: Scooter
  • Disallow:
  • Excite
  • User-agent: ArchitextSpider
  • Disallow:
  • InfoSeek
  • User-agent: UltraSeek
  • Disallow:
  • Lycos
  • User-agent: Lycos_Spider_(T-Rex)
  • Disallow:
  • LookSmart
  • User-agent: MantraAgent
  • Disallow:
  • Alltheweb
  • User-agent: FAST-WebCrawler
  • Disallow: