archives.mathrubhumi.com
robots.txt

Robots Exclusion Standard data for archives.mathrubhumi.com

Resource Scan

Scan Details

Site Domain archives.mathrubhumi.com
Base Domain mathrubhumi.com
Scan Status Ok
Last Scan2024-05-03T07:13:47+00:00
Next Scan 2024-05-10T07:13:47+00:00

Last Scan

Scanned2024-05-03T07:13:47+00:00
URL https://archives.mathrubhumi.com/robots.txt
Redirect https://archives.mathrubhumi.com/?ot=com.atex.plugins.seoplugin.RobotsTxt.ot
Domain IPs 23.50.95.100
Response IP 104.103.148.114
Found Yes
Hash 337fbb02501f53c0b330960faa536574f4abd773cebc6b0d962b2b22654922fe
SimHash a8244081a3f4

Groups

*

Rule Path
Allow /
Disallow /WEB-INF
Disallow /META-INF
Disallow /polopolydevelopment/
Disallow /status/
Disallow /search-results/
Disallow /poll/
Disallow /logger/
Disallow /cm/
Disallow /cmlink/
Disallow /error/
Disallow /errorpages/
Disallow /content/
Disallow /captcha/
Disallow /membership/
Disallow /blogimageupload/
Disallow /263
Disallow /json
Disallow /webpage
Disallow /tabjson
Disallow /home
Disallow /sponsored/
Disallow /section/

googlebot-news

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /image/

twitterbot

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /