thetradedesk.com
robots.txt

Robots Exclusion Standard data for thetradedesk.com

Resource Scan

Scan Details

Site Domain thetradedesk.com
Base Domain thetradedesk.com
Scan Status Ok
Last Scan2024-07-06T11:09:54+00:00
Next Scan 2024-07-20T11:09:54+00:00

Last Scan

Scanned2024-07-06T11:09:54+00:00
URL https://thetradedesk.com/robots.txt
Redirect https://www.thetradedesk.com/robots.txt
Redirect Domain www.thetradedesk.com
Redirect Base thetradedesk.com
Domain IPs 13.35.18.116, 13.35.18.55, 13.35.18.76, 13.35.18.9, 2600:9000:20c7:5e00:1a:9ca:c700:93a1, 2600:9000:20c7:7000:1a:9ca:c700:93a1, 2600:9000:20c7:7400:1a:9ca:c700:93a1, 2600:9000:20c7:7800:1a:9ca:c700:93a1, 2600:9000:20c7:9e00:1a:9ca:c700:93a1, 2600:9000:20c7:aa00:1a:9ca:c700:93a1, 2600:9000:20c7:b000:1a:9ca:c700:93a1, 2600:9000:20c7:dc00:1a:9ca:c700:93a1
Redirect IPs 104.22.36.93, 104.22.37.93, 172.67.7.135, 2606:4700:10::6816:245d, 2606:4700:10::6816:255d, 2606:4700:10::ac43:787
Response IP 172.67.7.135
Found Yes
Hash 32387885f02a747daaa715280ce589a5429aedb487c3095b3aec52ce3fb8b1bc
SimHash 090cc9528513

Groups

*

Rule Path
Allow /
Disallow /assets/global/documents-noindex/*

Other Records

Field Value
sitemap https://www.thetradedesk.com/sitemap.xml