andhrajyothi.com
robots.txt

Robots Exclusion Standard data for andhrajyothi.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	andhrajyothi.com
Base Domain	andhrajyothi.com
Scan Status	Failed
Failure Reason	Scan timed out.
Last Scan	2024-05-29T21:30:56+00:00
Next Scan	2024-06-05T21:30:56+00:00

Last Successful Scan

Scanned	2021-10-17T11:26:21+00:00
URL	http://andhrajyothi.com/robots.txt
Redirect	https://www.andhrajyothy.com/robots.txt
Redirect Domain	www.andhrajyothy.com
Redirect Base	andhrajyothy.com
Found	Yes
Hash	7d2d5b6b634fa75dedd95f92c59523e6aaf4e2b93806bb8391a56c4da6251464
SimHash	b8109d014774

Groups

*

Rule	Path
Allow	/

Rule

Path

Allow

/

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

/

googlebot

Rule	Path
Allow	/feeds/

Rule

Path

Allow

/feeds/

google

Rule	Path
Allow	/feeds/

Rule

Path

Allow

/feeds/

*

Rule	Path
Allow	/news/*
Allow	/photo/*
Allow	/amp/*

Rule

Path

Allow

/news/*

Allow

/photo/*

Allow

/amp/*

bingbot

Rule	Path
Disallow

Rule

Path

Disallow

proximic

Rule	Path
Disallow

Rule

Path

Disallow

Back to top

Other Records

Field	Value
sitemap	https://www.andhrajyothy.com/sitemap.xml

Field

Value

sitemap

https://www.andhrajyothy.com/sitemap.xml

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html

Back to top

andhrajyothi.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

mediapartners-google

googlebot

google

*

bingbot

proximic

Other Records

Comments

andhrajyothi.com
robots.txt