m.washingtontimes.com
robots.txt

Robots Exclusion Standard data for m.washingtontimes.com

Resource Scan

Scan Details

Site Domain m.washingtontimes.com
Base Domain washingtontimes.com
Scan Status Ok
Last Scan2024-09-24T05:01:26+00:00
Next Scan 2024-10-24T05:01:26+00:00

Last Scan

Scanned2024-09-24T05:01:26+00:00
URL https://m.washingtontimes.com/robots.txt
Domain IPs 104.22.58.64, 104.22.59.64, 172.67.8.119
Response IP 104.22.58.64
Found Yes
Hash c8004b5d24949b9048629ed418e1550e115abbe1b478f9505eab816b521182fa
SimHash 2346d46281d0

Groups

*

Rule Path
Disallow /apps/
Disallow /search/
Disallow /wire-*
Disallow /budget/
Disallow /guns/
Disallow /taxes/
Disallow /users/
Disallow /accounts/
Disallow /upi-breaking/
Disallow /weather/
Disallow /account/
Disallow /voting/
Disallow /mailfriend
Disallow /admin
Disallow /comments
Disallow /offensivecontent/
Disallow /accounts
Disallow /ajax

yandexbot

Rule Path
Disallow /apps/
Disallow /search/
Disallow /wire-*
Disallow /budget/
Disallow /guns/
Disallow /taxes/
Disallow /users/
Disallow /accounts/
Disallow /upi-breaking/
Disallow /weather/
Disallow /account/
Disallow /voting/
Disallow /mailfriend
Disallow /admin
Disallow /comments
Disallow /offensivecontent/
Disallow /accounts
Disallow /ajax

spinn3r

Rule Path
Disallow /apps/
Disallow /search/
Disallow /wire-*
Disallow /budget/
Disallow /guns/
Disallow /taxes/
Disallow /users/
Disallow /accounts/
Disallow /upi-breaking/
Disallow /weather/
Disallow /account/
Disallow /voting/
Disallow /mailfriend
Disallow /admin
Disallow /comments
Disallow /offensivecontent/
Disallow /accounts
Disallow /ajax

Other Records

Field Value
crawl-delay 30

mail.ru_bot

Rule Path
Disallow /apps/
Disallow /search/
Disallow /wire-*
Disallow /budget/
Disallow /guns/
Disallow /taxes/
Disallow /users/
Disallow /accounts/
Disallow /upi-breaking/
Disallow /weather/
Disallow /account/
Disallow /voting/
Disallow /mailfriend
Disallow /admin
Disallow /comments
Disallow /offensivecontent/
Disallow /accounts
Disallow /ajax

Other Records

Field Value
crawl-delay 60

baiduspider

Rule Path
Allow /news/
Disallow /apps/

Other Records

Field Value
crawl-delay 30

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Disallow /

bingbot

Rule Path
Allow /
Disallow /admin
Disallow /newsletters/modal/
Disallow /atom/
Disallow /topics/
Disallow /media/

Other Records

Field Value
crawl-delay 1

msnbot

Rule Path
Allow /news/
Disallow /apps/
Disallow /search/
Disallow /wire-*
Disallow /budget/
Disallow /guns/
Disallow /taxes/
Disallow /users/
Disallow /accounts/
Disallow /upi-breaking/
Disallow /weather/
Disallow /account/
Disallow /voting/
Disallow /mailfriend
Disallow /admin
Disallow /comments
Disallow /offensivecontent/
Disallow /accounts
Disallow /ajax

Other Records

Field Value
crawl-delay 1

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

googleproducer

Rule Path
Allow /atom/

amazonbot

Rule Path
Disallow /

applebot

Rule Path
Allow /
Disallow /atom/
Disallow /topics/
Disallow /media/
Disallow /news/2020/
Disallow /news/2021/
Disallow /news/2019/
Disallow /news/2018/
Disallow /news/2017/
Disallow /news/2016/
Disallow /news/2015/
Disallow /news/2014/
Disallow /news/2013/
Disallow /news/2012/
Disallow /news/2011/
Disallow /news/2010/
Disallow /news/2009/
Disallow /news/2008/
Disallow /news/2007/
Disallow /news/2006/
Disallow /news/2005/
Disallow /news/2004/
Disallow /news/2003/
Disallow /news/2002/
Disallow /news/2001/
Disallow /news/2000/
Disallow /news/1999/
Disallow /news/1998/
Disallow /news/1997/
Disallow /news/1996/
Disallow /news/1995/
Disallow /news/1994/

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.washingtontimes.com/sitemap_index.xml
sitemap https://www.washingtontimes.com/sitemap-stories.xml
sitemap https://www.washingtontimes.com/sitemap-entries.xml

Warnings

  • `host` is not a known field.