newspaper.mathrubhumi.com
robots.txt

Robots Exclusion Standard data for newspaper.mathrubhumi.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	newspaper.mathrubhumi.com
Base Domain	mathrubhumi.com
Scan Status	Ok
Last Scan	2024-11-08T12:13:06+00:00
Next Scan	2024-11-15T12:13:06+00:00

Last Scan

Scanned	2024-11-08T12:13:06+00:00
URL	https://newspaper.mathrubhumi.com/robots.txt
Redirect	https://newspaper.mathrubhumi.com/?upapi=true&ot=com.atex.plugins.seoplugin.RobotsTxt.ot
Domain IPs	23.36.49.240
Response IP	23.54.58.34
Found	Yes
Hash	6fa74b25454cfafba81332431714bd72789f330b168e4d14e9beb25923bff2f8
SimHash	6e2449c1a5b4

Groups

*

Rule	Path
Allow	/
Disallow	/WEB-INF
Disallow	/META-INF
Disallow	/polopolydevelopment/
Disallow	/status/
Disallow	/search-results/
Disallow	/poll/
Disallow	/logger/
Disallow	/cm/
Disallow	/cmlink/
Disallow	/error/
Disallow	/errorpages/
Disallow	/content/
Disallow	/captcha/
Disallow	/membership/
Disallow	/blogimageupload/
Disallow	/263
Disallow	/json
Disallow	/webpage
Disallow	/tabjson
Disallow	/home
Disallow	/sponsored/
Disallow	/section/

Rule

Path

Allow

Disallow

/WEB-INF

Disallow

/META-INF

Disallow

/polopolydevelopment/

Disallow

/status/

Disallow

/search-results/

Disallow

/poll/

Disallow

/logger/

Disallow

/cm/

Disallow

/cmlink/

Disallow

/error/

Disallow

/errorpages/

Disallow

/content/

Disallow

/captcha/

Disallow

/membership/

Disallow

/blogimageupload/

Disallow

/263

Disallow

/json

Disallow

/webpage

Disallow

/tabjson

Disallow

/home

Disallow

/sponsored/

Disallow

/section/

proximic

Rule	Path
Disallow	/

Rule

Path

Disallow

grapeshot

Rule	Path
Disallow	/

Rule

Path

Disallow

kraken

Rule	Path
Disallow	/

Rule

Path

Disallow

seznambot

Rule	Path
Disallow	/

Rule

Path

Disallow

awariorssbot

Rule	Path
Disallow	/

Rule

Path

Disallow

awariosmartbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

claude-web

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookbot

Rule	Path
Disallow	/

Rule

Path

Disallow

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

piplbot

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://newspaper.mathrubhumi.com/sitemap.xml

Field

Value

sitemap

https://newspaper.mathrubhumi.com/sitemap.xml

Comments

Sitemaps

newspaper.mathrubhumi.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

proximic

grapeshot

kraken

seznambot

awariorssbot

awariosmartbot

gptbot

ccbot

anthropic-ai

claude-web

facebookbot

google-extended

piplbot

Other Records

Comments

newspaper.mathrubhumi.com
robots.txt