tulsachamber.com
robots.txt

Robots Exclusion Standard data for tulsachamber.com

Resource Scan

Scan Details

Site Domain tulsachamber.com
Base Domain tulsachamber.com
Scan Status Ok
Last Scan2024-06-06T18:33:04+00:00
Next Scan 2024-07-06T18:33:04+00:00

Last Scan

Scanned2024-06-06T18:33:04+00:00
URL https://tulsachamber.com/robots.txt
Domain IPs 104.26.10.57, 104.26.11.57, 172.67.73.246, 2606:4700:20::681a:a39, 2606:4700:20::681a:b39, 2606:4700:20::ac43:49f6
Response IP 104.26.11.57
Found Yes
Hash d5f5b0d5d292322924e291f98cff45af9a040133fe2c90bd74cbf041592de717
SimHash adc092c0f534

Groups

*

Rule Path
Disallow /*print%3Dpdf*

Other Records

Field Value
crawl-delay 5

Comments

  • ROBOTS.TXT
  • tulsachamber.com
  • Google
  • User-agent: Googlebot
  • Disallow: *src=sba
  • Yahoo
  • User-agent: Slurp
  • Disallow:
  • Alta-Vista
  • User-agent: Scooter
  • Disallow:
  • Excite
  • User-agent: ArchitextSpider
  • Disallow:
  • InfoSeek
  • User-agent: UltraSeek
  • Disallow:
  • Lycos
  • User-agent: Lycos_Spider_(T-Rex)
  • Disallow:
  • LookSmart
  • User-agent: MantraAgent
  • Disallow:
  • Alltheweb
  • User-agent: FAST-WebCrawler
  • Disallow: