help.yahoo.com
robots.txt

Robots Exclusion Standard data for help.yahoo.com

Resource Scan

Scan Details

Site Domain help.yahoo.com
Base Domain yahoo.com
Scan Status Ok
Last Scan2024-11-04T17:44:19+00:00
Next Scan 2024-11-18T17:44:19+00:00

Last Scan

Scanned2024-11-04T17:44:19+00:00
URL https://help.yahoo.com/robots.txt
Domain IPs 106.10.236.37, 106.10.236.40, 180.222.114.11, 180.222.114.12, 2406:2000:98:800::e5, 2406:2000:98:800::e6, 2406:2000:e4:1604::1000, 2406:2000:e4:1604::1001
Response IP 106.10.236.40
Found Yes
Hash 3fa137a85af0c97d84a2ad1133ce097e62e1ca2730377e64c57f737c43c93148
SimHash 5d047950c690

Groups

*

Rule Path
Disallow /rogers/
Disallow /help/rogers/
Disallow /help/us/rogers/
Disallow /l/us/yahoo/amp/
Disallow /l/us/yahoo/apt/
Disallow /l/uk/yahoo/apt/
Disallow /l/de/yahoo/apt/
Disallow /mkb/search.php
Disallow /*?page=topics
Disallow /communities/

admantx

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claritybot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

huggingface

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

news-please

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

nutch

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekr

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

zumbot

Rule Path
Disallow /