newspaper.mathrubhumi.com
robots.txt

Robots Exclusion Standard data for newspaper.mathrubhumi.com

Resource Scan

Scan Details

Site Domain newspaper.mathrubhumi.com
Base Domain mathrubhumi.com
Scan Status Ok
Last Scan2024-11-08T12:13:06+00:00
Next Scan 2024-11-15T12:13:06+00:00

Last Scan

Scanned2024-11-08T12:13:06+00:00
URL https://newspaper.mathrubhumi.com/robots.txt
Redirect https://newspaper.mathrubhumi.com/?upapi=true&ot=com.atex.plugins.seoplugin.RobotsTxt.ot
Domain IPs 23.36.49.240
Response IP 23.54.58.34
Found Yes
Hash 6fa74b25454cfafba81332431714bd72789f330b168e4d14e9beb25923bff2f8
SimHash 6e2449c1a5b4

Groups

*

Rule Path
Allow /
Disallow /WEB-INF
Disallow /META-INF
Disallow /polopolydevelopment/
Disallow /status/
Disallow /search-results/
Disallow /poll/
Disallow /logger/
Disallow /cm/
Disallow /cmlink/
Disallow /error/
Disallow /errorpages/
Disallow /content/
Disallow /captcha/
Disallow /membership/
Disallow /blogimageupload/
Disallow /263
Disallow /json
Disallow /webpage
Disallow /tabjson
Disallow /home
Disallow /sponsored/
Disallow /section/

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

kraken

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://newspaper.mathrubhumi.com/sitemap.xml

Comments

  • Sitemaps