statemirror.com
robots.txt

Robots Exclusion Standard data for statemirror.com

Resource Scan

Scan Details

Site Domain statemirror.com
Base Domain statemirror.com
Scan Status Ok
Last Scan2025-09-10T18:04:00+00:00
Next Scan 2025-09-17T18:04:00+00:00

Last Scan

Scanned2025-09-10T18:04:00+00:00
URL https://statemirror.com/robots.txt
Redirect https://www.statemirror.com/robots.txt
Redirect Domain www.statemirror.com
Redirect Base statemirror.com
Domain IPs 13.234.138.67, 3.7.30.147, 65.0.202.73
Redirect IPs 3.165.75.28, 3.165.75.43, 3.165.75.52, 3.165.75.66
Response IP 3.165.75.43
Found Yes
Hash ac6223925399f16162cece5ccda67a6a54593e9a628ca44f359b358e744d75f8
SimHash 925a84138ff3

Groups

*

Rule Path
Allow /
Disallow /admin/*
Disallow /search/*
Disallow /breaking-ticker-articles/*
Disallow /search?*
Disallow /xhr/*
Disallow /preview/story-*
Disallow /amp/preview/story-*
Disallow /staging/*
Disallow /alfoo
Disallow /sildoo
Disallow /dutas
Disallow /metsmall
Disallow /bulletin/*
Disallow /cartoons/*
Disallow /tags/?%3F%3F
Disallow /weekly-items
Disallow /daily-items
Disallow /bulletin
Disallow /the-news-state
Disallow /ashwani
Disallow /reema-roy
Disallow /abhishek
Disallow /ddff
Disallow /ashwani-kumar-mishra-from-uttar-pradesh
Disallow /author/tech-seo-product
Disallow /author/editor-1
Disallow /author/editor-2
Disallow /author/editor-4
Disallow /pdf_upload/1640358657509219522021-406708.pdf
Disallow /xhr/getNewsMixin*
Disallow /h-ajax-request/*
Allow /content/servlet/RDESController?*
Allow /ads.txt

ahrefsbot

Product Comment
ahrefsbot SEO backlink crawler (Ahrefs)
Rule Path
Disallow /

semrushbot

Product Comment
semrushbot SEO crawler (Semrush)
Rule Path
Disallow /

mj12bot

Product Comment
mj12bot SEO crawler (Majestic)
Rule Path
Disallow /

dotbot

Product Comment
dotbot SEO crawler (Moz)
Rule Path
Disallow /

blexbot

Product Comment
blexbot SEO crawler (WebMeUp)
Rule Path
Disallow /

timpibot

Product Comment
timpibot LLM dataset crawler (Timpi)
Rule Path
Disallow /

diffbot

Product Comment
diffbot AI data scraper (Diffbot)
Rule Path
Disallow /

ccbot

Product Comment
ccbot Common Crawl bot (if you choose to block it)
Rule Path
Disallow /

turnitinbot

Product Comment
turnitinbot Plagiarism checker bot
Rule Path
Disallow /

piplbot

Product Comment
piplbot Data broker / people search bot
Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.statemirror.com/sitemap/sitemap-index.xml
sitemap https://www.statemirror.com/news-sitemap-daily.xml
sitemap https://www.statemirror.com/sitemap-daily.xml

Comments

  • robots.txt for https://www.statemirror.com/
  • Disallow bots that are harmful, heavy, or low-value: