usatoday.com
robots.txt

Robots Exclusion Standard data for usatoday.com

Resource Scan

Scan Details

Site Domain usatoday.com
Base Domain usatoday.com
Scan Status Ok
Last Scan2024-10-29T19:36:53+00:00
Next Scan 2024-11-05T19:36:53+00:00

Last Scan

Scanned2024-10-29T19:36:53+00:00
URL https://usatoday.com/robots.txt
Redirect https://www.usatoday.com/robots.txt
Redirect Domain www.usatoday.com
Redirect Base usatoday.com
Domain IPs 151.101.42.62
Redirect IPs 151.101.130.62, 151.101.194.62, 151.101.2.62, 151.101.66.62
Response IP 199.232.46.62
Found Yes
Hash cd885499987d612dc4a21308bcd1a1ad0fd02c10809b149c937211a3cf425e0e
SimHash 7d9b1bffc5e2

Groups

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /story/sponsor-story/
Disallow /picture-gallery/sponsor-story/
Disallow /videos/sponsor-story/
Disallow /longform/sponsor-story/
Disallow /pages/interactives/sponsor-story/
Disallow /interactives/sponsor-story/
Disallow /videos/embed/

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

news-please

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler

Rule Path
Disallow /

peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

*

Rule Path
Disallow /errors
Disallow /interactive/
Disallow /userauth/
Disallow /ugc/
Disallow /feeds/
Disallow /services/
Disallow /facebook/
Disallow /version-info/
Disallow /longform/draft/
Disallow /story/draft/
Disallow /topic/*/smart/
Disallow /search
Disallow /module-showcase/
Disallow /newsletter/
Disallow /blended-newsletter/
Disallow /story/nletter/
Disallow /sports/services/photos/
Disallow /optimus
Disallow /ux-train
Disallow /story/advisory/
Disallow /.cam-tangent/
Disallow /pbd/
Disallow /gciaf/
Disallow /exp-cruise
Disallow /exp-las-vegas2
Disallow /exp-faw
Disallow /exp-caribbean
Disallow /exp-beach
Disallow /exp-cruise2
Disallow /yourtake
Disallow /story/sports/ncaab/2014/03/20/ge-cfo-challenge-daniel-kelly-amfam/6661213/
Disallow /story/2014/03/20/ge-cfo-challenge-david-bartlett-amway/6653003/
Disallow /story/sports/ncaab/2014/03/20/ge-cfo-challenge-art-mccarthy-neulion/6655521/
Disallow /story/sports/ncaab/2014/03/20/ge-cfo-challenge-david-gross-major-league-lacrosse/6646987/
Disallow /money/lookup/stocks/
Disallow /money/blueprint/archive/
Disallow /money/blueprint/transfer?pname=

Other Records

Field Value
sitemap https://www.usatoday.com/news-sitemap.xml
sitemap https://www.usatoday.com/web-sitemap-index.xml
sitemap https://www.usatoday.com/video-sitemap-index.xml
sitemap https://www.usatoday.com/money/blueprint/sitemap.xml
sitemap https://www.usatoday.com/money/blueprint/sitemap-news.xml
sitemap https://www.usatoday.com/online-betting/sitemap.xml
sitemap https://www.usatoday.com/online-betting/news-sitemap.xml
sitemap https://www.usatoday.com/money/homefront/sitemap.xml
sitemap https://www.usatoday.com/tech/internet/sitemap.xml

Comments

  • robots.txt file for https://www.usatoday.com/