warehaus.com
robots.txt

Robots Exclusion Standard data for warehaus.com

Resource Scan

Scan Details

Site Domain warehaus.com
Base Domain warehaus.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-29T15:26:46+00:00
Next Scan 2024-11-27T15:26:46+00:00

Last Successful Scan

Scanned2023-08-06T11:37:29+00:00
URL https://www.warehaus.com/robots.txt
Domain IPs 2600:9000:2003:5600:8:48e3:a780:93a1, 2600:9000:2003:8e00:8:48e3:a780:93a1, 2600:9000:2003:ba00:8:48e3:a780:93a1, 2600:9000:2003:bc00:8:48e3:a780:93a1, 2600:9000:2003:c800:8:48e3:a780:93a1, 2600:9000:2003:d200:8:48e3:a780:93a1, 2600:9000:2003:da00:8:48e3:a780:93a1, 2600:9000:2003:e800:8:48e3:a780:93a1, 54.192.150.105, 54.192.150.120, 54.192.150.26, 54.192.150.74
Response IP 54.192.150.120
Found Yes
Hash c36038f68d31ed8131910e30c74bf61cafed0157138de412d1697facdbca96dd
SimHash 2400d0c05592

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /api
Disallow /click_ad
Disallow /clk_ad
Disallow /wr_clk

adsbot-google

Rule Path
Disallow

Comments

  • DWNDSO-2922: SEM Campaign Addition