beta.cricket.yahoo.com
robots.txt

Robots Exclusion Standard data for beta.cricket.yahoo.com

Resource Scan

Scan Details

Site Domain beta.cricket.yahoo.com
Base Domain yahoo.com
Scan Status Ok
Last Scan2024-10-17T11:33:20+00:00
Next Scan 2024-11-16T11:33:20+00:00

Last Scan

Scanned2024-10-17T11:33:20+00:00
URL http://beta.cricket.yahoo.com/robots.txt
Redirect https://www.yahoo.com/robots.txt
Redirect Domain www.yahoo.com
Redirect Base yahoo.com
Domain IPs 13.248.158.7, 76.223.84.192
Redirect IPs 106.10.236.37, 106.10.236.40, 180.222.114.11, 180.222.114.12, 2406:2000:98:800::e5, 2406:2000:98:800::e6, 2406:2000:e4:1604::1000, 2406:2000:e4:1604::1001
Response IP 106.10.236.40
Found Yes
Hash 02b5e4bb6514d5bc109c2125db5b4f8428eab0ca76ea08d60ae235596e6c0cb6
SimHash 5905fa408581

Groups

*

Rule Path
Disallow /p/
Disallow /r/
Disallow /bin/
Disallow /caas/
Disallow /blank.html
Disallow /includes/
Disallow /_td_api
Disallow /tdv2_fp
Disallow /nel_ms
Disallow /fp_ms
Disallow /sports_fp_ms
Disallow /search_ms
Disallow /_tdpp_api
Disallow /_remote
Disallow /_multiremote
Disallow /_tdhl_api
Disallow /digest
Disallow /fpjs
Disallow /myjs

admantx

Rule Path
Disallow /

alphabot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

buzzbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claritybot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

huggingface

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

neevabot

Rule Path
Disallow /

news-please

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

nutch

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

perplexity-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

seekr

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

zumbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.yahoo.com/sitemap/fp-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/gma/sitemaps/gma-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/entertainment/sitemaps/entertainment-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/entertainment/sitemaps/entertainment-sitemap_googlenewsindex_US_en-US.xml.gz
sitemap https://www.yahoo.com/lifestyle/sitemaps/lifestyles-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/lifestyle/sitemaps/lifestyle-sitemap_googlenewsindex_US_en-US.xml.gz
sitemap https://www.yahoo.com/subscriptions/sitemap.xml
sitemap https://www.yahoo.com/news/weather/sitemap.xml
sitemap https://www.yahoo.com/sitemap-uh.xml
sitemap https://www.yahoo.com/news-sitemap-index.xml
sitemap https://www.yahoo.com/sitemap-index.xml
sitemap https://www.yahoo.com/topics/sitemaps/topics-sitemap_index_US_en-US.xml.gz
sitemap https://www.yahoo.com/games/sitemaps/sitemap_en-us.xml
sitemap https://www.yahoo.com/tech/sitemap-index.xml
sitemap https://www.yahoo.com/tech/news-sitemap-index.xml