washpost.com
robots.txt

Robots Exclusion Standard data for washpost.com

Resource Scan

Scan Details

Site Domain washpost.com
Base Domain washpost.com
Scan Status Ok
Last Scan2024-11-12T00:04:33+00:00
Next Scan 2024-11-19T00:04:33+00:00

Last Scan

Scanned2024-11-12T00:04:33+00:00
URL https://washpost.com/robots.txt
Redirect https://www.washingtonpost.com:443/robots.txt
Redirect Domain www.washingtonpost.com
Redirect Base washingtonpost.com
Domain IPs 3.230.40.48, 54.166.27.209
Redirect IPs 173.222.144.137
Response IP 173.222.144.137
Found Yes
Hash b857db43c7ef554403c23eff53f0964867b93a1b425342cc02c56523675a5c39
SimHash d43849b1a0f8

Groups

*

Rule Path
Disallow /*_print.html
Disallow /*_email.html
Disallow /*_singlePage.html
Disallow /*_allComments.html
Disallow /*_jsn.json
Disallow /*_jsonpStatic.js
Disallow /*_nitf.xml
Disallow /*_newsml.html
Disallow /*_qa.html
Disallow /*_meta.xml
Disallow /*_jsnp.js
Disallow /*_json.json
Disallow /*_search.html
Disallow /*_jsonp.js
Disallow /*_jsnpStatic.js
Disallow /*_rss.xml
Disallow /*_mobile.mobile
Disallow /*_mobile.xml
Disallow /*_allCommentsClassicBlog.html
Disallow /*_seo.html
Disallow /*_nimbusJson.json
Disallow /*_nimbusJsonp.js
Disallow /*_nimbusJsonpStatic.js
Disallow /*_modal.html
Disallow /todays_paper/
Disallow /rw/WashingtonPost/Content/Epaper/
Disallow /ac2/
Disallow /blogs/slow-ride/
Disallow /local/blogsandcolumns/slow-ride-story-tanked
Disallow /local/blogsandcolumns/slow-ride-story-achenblog
Disallow /local/blogsandcolumns/slow-ride-stream-tanked
Disallow /local/blogsandcolumns/slow-ride-front
Disallow /utils/
Disallow /jobs/JS_JobSearchResult
Disallow /jobs/UpdateJobEmployerCounterServlet
Disallow /jobs/JS_Login
Disallow /jobs/EU_UpdateJobEmployerCounter
Disallow /blogs/nationals-journal-beta/
Disallow /blogs/test/
Disallow /posttv-beta/
Disallow /posttv/sponsored-video/
Disallow /posttv/c/trendex/
Disallow /posttv/c/video_search/
Disallow /posttv/posttv/trendex
Disallow /posttv/c/embed/
Disallow /rweb/
Disallow /wp-stat/vrroom/
Disallow /classic-apps/
Disallow /news/test/
Disallow /tablet/
Disallow /news/tablet/
Disallow /sf/test/
Disallow /news/test-liveblog/
Disallow /pb/
Allow /pb/resources/
Allow /pb/gr/
Allow /pb/resource/
Disallow /homepage-video-test
Disallow /testpage-forhomepage
Disallow /knowmore
Disallow /test/
Disallow /sslsingle
Disallow /amphtml/news/test/
Disallow /amphtml/blogs/test/
Disallow /amphtml/classic-apps/
Disallow /amphtml/utils/
Disallow /newsletter/
Disallow /wp-dyn/
Disallow /wp-srv/
Disallow /bandito/
Disallow /Fragment/SysConfig/
Disallow /recipes/search/
Disallow /talk/
Disallow /wp-stat/ad/
Disallow /*?*outputType=comment
Disallow /pwapi-proxy/pwproxy/*
Disallow /pwapiv2/
Disallow /*?*outputType=accessibility
Disallow /wp-adv/
Disallow /newssearch/
Disallow /wp-admin/
Disallow /gdpr-consent/
Disallow /*?*outputType=tracking
Disallow /tetro/
Disallow /comments/
Disallow /comments
Disallow /search
Disallow /s/*
Disallow /embed/
Disallow /native/
Disallow /subscribe/braintree/
Disallow /subscribe/enterpriseportal/
Disallow /subscribe/foryouapi/
Disallow /subscribe/lagoon/
Disallow /subscribe/offers/service/
Disallow /subscribe/onsiteapi/
Disallow /subscribe/paywall/
Disallow /subscribe/person/
Disallow /subscribe/preferenceapi/
Disallow /subscribe/subscriptionapi/
Disallow /subscribe/user/
Disallow /subscribe/signin/
Disallow /subscribe/signup/
Disallow /wpost/proxy
Disallow /ehf/
Disallow /ehf/*
Disallow /subscribe/logging/*
Disallow /blogs/*
Disallow /gog/*
Disallow /arcio/fact-checker/

twitterbot

Rule Path
Allow /posttv-beta/
Disallow /amphtml/*

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /licensing-syndication
Disallow /licensing-syndication/*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.washingtonpost.com/sitemaps/sitemap.xml.gz
sitemap https://www.washingtonpost.com/sitemaps/news-sitemap.xml.gz
sitemap https://www.washingtonpost.com/sitemaps/author-sitemap.xml.gz
sitemap https://www.washingtonpost.com/sitemaps/section-sitemap.xml.gz
sitemap https://www.washingtonpost.com/elections/results/sitemap.xml
sitemap https://www.washingtonpost.com/arcio/sitemap/video/index/