ne1.www.yahoo.com
robots.txt

Robots Exclusion Standard data for ne1.www.yahoo.com

Resource Scan

Scan Details

Site Domain ne1.www.yahoo.com
Base Domain yahoo.com
Scan Status Ok
Last Scan2024-11-07T21:54:19+00:00
Next Scan 2024-11-21T21:54:19+00:00

Last Scan

Scanned2024-11-07T21:54:19+00:00
URL https://ne1.www.yahoo.com/robots.txt
Domain IPs 2001:4998:44:3507::7000, 74.6.231.19
Response IP 74.6.231.19
Found Yes
Hash 02b5e4bb6514d5bc109c2125db5b4f8428eab0ca76ea08d60ae235596e6c0cb6
SimHash 5905fa408581

Groups

*

Rule Path
Disallow /p/
Disallow /r/
Disallow /bin/
Disallow /caas/
Disallow /blank.html
Disallow /includes/
Disallow /_td_api
Disallow /tdv2_fp
Disallow /nel_ms
Disallow /fp_ms
Disallow /sports_fp_ms
Disallow /search_ms
Disallow /_tdpp_api
Disallow /_remote
Disallow /_multiremote
Disallow /_tdhl_api
Disallow /digest
Disallow /fpjs
Disallow /myjs

admantx

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claritybot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

huggingface

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

news-please

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

nutch

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekr

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.yahoo.com/sitemap/fp-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/gma/sitemaps/gma-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/entertainment/sitemaps/entertainment-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/entertainment/sitemaps/entertainment-sitemap_googlenewsindex_US_en-US.xml.gz
sitemap https://www.yahoo.com/lifestyle/sitemaps/lifestyles-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/lifestyle/sitemaps/lifestyle-sitemap_googlenewsindex_US_en-US.xml.gz
sitemap https://www.yahoo.com/subscriptions/sitemap.xml
sitemap https://www.yahoo.com/news/weather/sitemap.xml
sitemap https://www.yahoo.com/sitemap-uh.xml
sitemap https://www.yahoo.com/news-sitemap-index.xml
sitemap https://www.yahoo.com/sitemap-index.xml
sitemap https://www.yahoo.com/topics/sitemaps/topics-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/games/sitemaps/sitemap_en-us.xml
sitemap https://www.yahoo.com/tech/sitemap-index.xml
sitemap https://www.yahoo.com/tech/news-sitemap-index.xml