thequint.com
robots.txt

Robots Exclusion Standard data for thequint.com

Resource Scan

Scan Details

Site Domain thequint.com
Base Domain thequint.com
Scan Status Ok
Last Scan2024-09-21T08:43:15+00:00
Next Scan 2024-09-28T08:43:15+00:00

Last Scan

Scanned2024-09-21T08:43:15+00:00
URL https://thequint.com/robots.txt
Redirect https://www.thequint.com/robots.txt
Redirect Domain www.thequint.com
Redirect Base thequint.com
Domain IPs 23.20.179.164, 54.158.195.16
Redirect IPs 104.18.90.198, 104.18.91.198, 104.18.92.198, 104.18.93.198, 104.18.94.198, 2606:4700::6812:5ac6, 2606:4700::6812:5bc6, 2606:4700::6812:5cc6, 2606:4700::6812:5dc6, 2606:4700::6812:5ec6
Response IP 104.18.91.198
Found Yes
Hash 8cf9ea458b6eac2ee803191dd764b803f0cf92d4b3b2d2d0afb583d80f924f04
SimHash 485f8cc02fbb

Groups

*

Rule Path
Allow /plan-selection
Disallow /story/
Disallow /preview/
Disallow /static/
Disallow /api/auth/
Disallow /api/access/
Disallow /*?prerender=true
Disallow /thequint/assets/
Disallow /api/v1/

gptbot

Rule Path
Disallow /

semrush

Rule Path
Disallow /

ahref

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

claude

Rule Path
Disallow /

open ai

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thequint.com/sitemap/today
sitemap https://www.thequint.com/sitemap/yesterday