news.yahoo.com
robots.txt

Robots Exclusion Standard data for news.yahoo.com

Resource Scan

Scan Details

Site Domain news.yahoo.com
Base Domain yahoo.com
Scan Status Ok
Last Scan2024-05-23T23:12:34+00:00
Next Scan 2024-06-06T23:12:34+00:00

Last Scan

Scanned2024-05-23T23:12:34+00:00
URL https://news.yahoo.com/robots.txt
Domain IPs 106.10.236.37, 106.10.236.40, 180.222.114.11, 180.222.114.12, 2406:2000:98:800::e5, 2406:2000:98:800::e6, 2406:2000:e4:1604::1000, 2406:2000:e4:1604::1001
Response IP 180.222.114.11
Found Yes
Hash d3f31197692d10f2f049a36b07e09d6d88b631950cb116246ac1da3eb24b20a4
SimHash 6a3cbc340793

Groups

*

Rule Path
Disallow /caas/
Disallow /_td_api
Disallow /tdv2_fp
Disallow /nel_ms
Disallow /fp_ms
Disallow /sports_fp_ms
Disallow /search_ms

Other Records

Field Value
sitemap https://news.yahoo.com/sitemaps/news-sitemap_index_US_en-US.xml.gz
sitemap https://news.yahoo.com/sitemaps/news-sitemap_googlenewsindex_US_en-US.xml.gz