theanswerhawaii.com
robots.txt

Robots Exclusion Standard data for theanswerhawaii.com

Resource Scan

Scan Details

Site Domain theanswerhawaii.com
Base Domain theanswerhawaii.com
Scan Status Ok
Last Scan2024-10-30T06:45:54+00:00
Next Scan 2024-11-06T06:45:54+00:00

Last Scan

Scanned2024-10-30T06:45:54+00:00
URL https://theanswerhawaii.com/robots.txt
Domain IPs 104.26.6.86, 104.26.7.86, 172.67.74.7, 2606:4700:20::681a:656, 2606:4700:20::681a:756, 2606:4700:20::ac43:4a07
Response IP 172.67.74.7
Found Yes
Hash 9eb6da8f847fba5be5b2921fb282a62fcb9e475b2273c305310b02b30b08180b
SimHash 303cd090c0b5

Groups

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

twitterbot

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

architextspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp.so/1.0

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp/2.0j

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp/2.0-kite-hourly

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp/2.0-owlweekly

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

slurp/3.0-au

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

teoma

Product Comment
teoma ASK.com

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

scooter

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

lycos_spider_(t-rex)

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

infoseek sidewinder/9.0

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

*

Rule Path Comment
Disallow /newsletter/ disallow this directory

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Comments

  • User-agent: *
  • Disallow: /
  • User-Agent: SemrushBot
  • Disallow: /
  • Disallow: /
  • AI Bots
  • OpenAI bots
  • Google AI bots - Bard, Gemini and VertexAI
  • commoncrawl AI
  • Perplexity AI
  • End AI Bots