tv.mathrubhumi.com
robots.txt

Robots Exclusion Standard data for tv.mathrubhumi.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	tv.mathrubhumi.com
Base Domain	mathrubhumi.com
Scan Status	Ok
Last Scan	2024-05-04T00:23:31+00:00
Next Scan	2024-05-11T00:23:31+00:00

Last Scan

Scanned	2024-05-04T00:23:31+00:00
URL	https://tv.mathrubhumi.com/robots.txt
Domain IPs	23.50.95.100
Response IP	104.103.148.114
Found	Yes
Hash	1313129735eaf25f34b586a8250a61c87b44e7a26cfcb1cd0bdbd3ebd802e46f
SimHash	ad344c13c171

Groups

*

Rule	Path
Allow	/
Disallow	/WEB-INF
Disallow	/META-INF
Disallow	/polopolydevelopment/
Disallow	/status/
Disallow	/search-results/
Disallow	/poll/
Disallow	/logger/
Disallow	/cm/
Disallow	/error/
Disallow	/errorpages/
Disallow	/content/
Disallow	/captcha/
Disallow	/membership/
Disallow	/blogimageupload/
Disallow	/search*
Disallow	/json
Disallow	/app

Rule

Path

Allow

Disallow

/WEB-INF

Disallow

/META-INF

Disallow

/polopolydevelopment/

Disallow

/status/

Disallow

/search-results/

Disallow

/poll/

Disallow

/logger/

Disallow

/cm/

Disallow

/error/

Disallow

/errorpages/

Disallow

/content/

Disallow

/captcha/

Disallow

/membership/

Disallow

/blogimageupload/

Disallow

/search*

Disallow

/json

Disallow

/app

googlebot-news

Rule	Path
Allow	/

Rule

Path

Allow

googlebot-mobile
googlebot-image

Rule	Path
Allow	/polopoly_fs/

Rule

Path

Allow

/polopoly_fs/

twitterbot

Rule	Path
Allow	/

Rule

Path

Allow

ia_archiver

Rule	Path
Allow	/

Rule

Path

Allow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

tv.mathrubhumi.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-news

googlebot-mobilegooglebot-image

twitterbot

ia_archiver

gptbot

ccbot

tv.mathrubhumi.com
robots.txt

googlebot-mobile
googlebot-image