greenvillechamber.org
robots.txt

Robots Exclusion Standard data for greenvillechamber.org

Resource Scan

Scan Details

Site Domain greenvillechamber.org
Base Domain greenvillechamber.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-02T19:56:24+00:00
Next Scan 2024-07-31T19:56:24+00:00

Last Successful Scan

Scanned2023-06-15T19:33:34+00:00
URL https://greenvillechamber.org/robots.txt
Redirect http://www.greenvillechamber.org/robots.txt
Redirect Domain www.greenvillechamber.org
Redirect Base greenvillechamber.org
Domain IPs 104.26.8.198, 104.26.9.198, 172.67.69.99, 2606:4700:20::681a:8c6, 2606:4700:20::681a:9c6, 2606:4700:20::ac43:4563
Redirect IPs 104.26.8.198, 104.26.9.198, 172.67.69.99, 2606:4700:20::681a:8c6, 2606:4700:20::681a:9c6, 2606:4700:20::ac43:4563
Response IP 172.67.69.99
Found Yes
Hash e30ebf192984fa75105082a48870756c1e674550d3f25b3f796718efb35c98a2
SimHash a5c89ac0e5b4

Groups

*

Rule Path
Disallow

Other Records

Field Value
crawl-delay 5

Comments

  • ROBOTS.TXT
  • www.greenvillechamber.org
  • Google
  • User-agent: Googlebot
  • Disallow:
  • Yahoo
  • User-agent: Slurp
  • Disallow:
  • Alta-Vista
  • User-agent: Scooter
  • Disallow:
  • Excite
  • User-agent: ArchitextSpider
  • Disallow:
  • InfoSeek
  • User-agent: UltraSeek
  • Disallow:
  • Lycos
  • User-agent: Lycos_Spider_(T-Rex)
  • Disallow:
  • LookSmart
  • User-agent: MantraAgent
  • Disallow:
  • Alltheweb
  • User-agent: FAST-WebCrawler
  • Disallow: