routledge.com
robots.txt

Robots Exclusion Standard data for routledge.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	routledge.com
Base Domain	routledge.com
Scan Status	Ok
Last Scan	2024-10-29T22:27:24+00:00
Next Scan	2024-11-28T22:27:24+00:00

Last Scan

Scanned	2024-10-29T22:27:24+00:00
URL	https://routledge.com/robots.txt
Redirect	https://www.routledge.com/robots.txt
Redirect Domain	www.routledge.com
Redirect Base	routledge.com
Domain IPs	104.17.184.26, 104.17.185.26
Redirect IPs	104.17.184.26, 104.17.185.26
Response IP	104.17.185.26
Found	Yes
Hash	d1ab205e75471a1db4bfe757c4decf67bbe4e4a79bed23570034800cce0cbafb
SimHash	25354c14dc52

Groups

*

Rule	Path
Disallow	/account
Disallow	/wishlist
Disallow	/cart
Disallow	/c/
Disallow	/cw/
Disallow	/cdn-cgi/

Rule

Path

Disallow

/account

Disallow

/wishlist

Disallow

/cart

Disallow

/c/

Disallow

/cw/

Disallow

/cdn-cgi/

ahrefsbot

Rule	Path
Disallow	/
Disallow	/cdn-cgi/

Rule

Path

Disallow

/

Disallow

/cdn-cgi/

Back to top

Other Records

Field	Value
sitemap	https://www.routledge.com/sitemap_index.xml

Field

Value

sitemap

https://www.routledge.com/sitemap_index.xml

Back to top

Comments

Disallow select URLs
Sitemaps-https

Back to top

routledge.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

ahrefsbot

Other Records

Comments

routledge.com
robots.txt