turtl.co
robots.txt

Robots Exclusion Standard data for turtl.co

Resource Scan

Scan Details

Site Domain turtl.co
Base Domain turtl.co
Scan Status Ok
Last Scan2025-08-23T05:05:24+00:00
Next Scan 2025-09-22T05:05:24+00:00

Last Scan

Scanned2025-08-23T05:05:24+00:00
URL https://turtl.co/robots.txt
Domain IPs 199.60.103.146, 199.60.103.46
Response IP 199.60.103.146
Found Yes
Hash 9a413bb88fc30b08be9ac15d9fee76e0fed1099a4ae9d6977f282e90643d3cee
SimHash 2808d5f1c5b3

Groups

*

Rule Path
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*
Disallow */tag/
Disallow /blog/author/*?

Other Records

Field Value
sitemap https://turtl.co/sitemap.xml