thetradedesk.com
robots.txt

Robots Exclusion Standard data for thetradedesk.com

Resource Scan

Scan Details

Site Domain thetradedesk.com
Base Domain thetradedesk.com
Scan Status Ok
Last Scan2024-11-09T11:12:19+00:00
Next Scan 2024-11-23T11:12:19+00:00

Last Scan

Scanned2024-11-09T11:12:19+00:00
URL https://thetradedesk.com/robots.txt
Redirect https://www.thetradedesk.com/robots.txt
Redirect Domain www.thetradedesk.com
Redirect Base thetradedesk.com
Domain IPs 13.33.88.127, 13.33.88.68, 13.33.88.88, 13.33.88.98, 2600:9000:223b:1600:1a:9ca:c700:93a1, 2600:9000:223b:4a00:1a:9ca:c700:93a1, 2600:9000:223b:6e00:1a:9ca:c700:93a1, 2600:9000:223b:8a00:1a:9ca:c700:93a1, 2600:9000:223b:9c00:1a:9ca:c700:93a1, 2600:9000:223b:a200:1a:9ca:c700:93a1, 2600:9000:223b:c400:1a:9ca:c700:93a1, 2600:9000:223b:e00:1a:9ca:c700:93a1
Redirect IPs 104.22.36.93, 104.22.37.93, 172.67.7.135, 2606:4700:10::6816:245d, 2606:4700:10::6816:255d, 2606:4700:10::ac43:787
Response IP 172.67.7.135
Found Yes
Hash 32387885f02a747daaa715280ce589a5429aedb487c3095b3aec52ce3fb8b1bc
SimHash 090cc9528513

Groups

*

Rule Path
Allow /
Disallow /assets/global/documents-noindex/*

Other Records

Field Value
sitemap https://www.thetradedesk.com/sitemap.xml