chron.com
robots.txt

Robots Exclusion Standard data for chron.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	chron.com
Base Domain	chron.com
Scan Status	Ok
Last Scan	2024-10-31T13:18:30+00:00
Next Scan	2024-11-07T13:18:30+00:00

Last Scan

Scanned	2024-10-31T13:18:30+00:00
URL	https://chron.com/robots.txt
Redirect	https://www.chron.com/robots.txt
Redirect Domain	www.chron.com
Redirect Base	chron.com
Domain IPs	98.129.228.59
Redirect IPs	151.101.0.200, 151.101.128.200, 151.101.192.200, 151.101.64.200
Response IP	199.232.44.200
Found	Yes
Hash	6b3ae2610e3cd22f05f58ff21a393439b4ca0ea51b57d1d7932f56c7dcfe322e
SimHash	cc3e405782d2

Groups

*

Rule	Path
Disallow	/style/beauty/hearstmagazines/
Disallow	/style/fashion/hearstmagazines/
Disallow	/living/relationships/hearstmagazines/
Disallow	/homeandgarden/home/hearstmagazines/
Disallow	/living/wellness/hearstmagazines/
Disallow	/adtest
Disallow	/sponsored
Disallow	/events/
Disallow	/search

Rule

Path

Disallow

/style/beauty/hearstmagazines/

Disallow

/style/fashion/hearstmagazines/

Disallow

/living/relationships/hearstmagazines/

Disallow

/homeandgarden/home/hearstmagazines/

Disallow

/living/wellness/hearstmagazines/

Disallow

/adtest

Disallow

/sponsored

Disallow

/events/

Disallow

/search

googlebot-news

Rule	Path
Disallow	/business/press-releases

Rule

Path

Disallow

/business/press-releases

ccbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/413gkwMT/

Rule

Path

Disallow

/413gkwMT/

applebot-extended

Rule	Path
Disallow	/private/

Rule

Path

Disallow

/private/

Back to top

Other Records

Field	Value
sitemap	https://www.chron.com/sitemap.xml
sitemap	https://www.chron.com/sitemap_news.xml

Field

Value

sitemap

https://www.chron.com/sitemap.xml

sitemap

https://www.chron.com/sitemap_news.xml

Back to top

chron.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot-news

ccbot

*

applebot-extended

Other Records

chron.com
robots.txt