dailysamachara.com
robots.txt

Robots Exclusion Standard data for dailysamachara.com

Resource Scan

Scan Details

Site Domain dailysamachara.com
Base Domain dailysamachara.com
Scan Status Ok
Last Scan2026-02-01T15:17:21+00:00
Next Scan 2026-02-08T15:17:21+00:00

Last Scan

Scanned2026-02-01T15:17:21+00:00
URL https://dailysamachara.com/robots.txt
Redirect https://www.dailysamachara.com/robots.txt
Redirect Domain www.dailysamachara.com
Redirect Base dailysamachara.com
Domain IPs 104.21.11.157, 172.67.166.99, 2606:4700:3035::6815:b9d, 2606:4700:3037::ac43:a663
Redirect IPs 104.21.11.157, 172.67.166.99, 2606:4700:3035::6815:b9d, 2606:4700:3037::ac43:a663
Response IP 104.21.11.157
Found Yes
Hash 65e97434ff888e3045798d8f426f83d79c76bb110805729d54646be6dc65d99b
SimHash 711c4942c0a2

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claudebot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

claude-web

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

ccbot

Rule Path
Allow /

bytespider

Rule Path
Allow /

facebookbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

amazonbot

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

duckassistbot

Rule Path
Allow /
Disallow *?utm_
Disallow *?replytocom
Disallow *?amp

Other Records

Field Value
sitemap https://www.hindiscan.com/sitemap_index.xml