therealdeal.net
robots.txt

Robots Exclusion Standard data for therealdeal.net

Resource Scan

Scan Details

Site Domain therealdeal.net
Base Domain therealdeal.net
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-15T14:14:24+00:00
Next Scan 2024-12-14T14:14:24+00:00

Last Successful Scan

Scanned2024-08-17T14:07:11+00:00
URL http://therealdeal.net/robots.txt
Redirect https://therealdeal.com/robots.txt
Redirect Domain therealdeal.com
Redirect Base therealdeal.com
Domain IPs 209.17.116.163
Redirect IPs 13.33.30.101, 13.33.30.89, 13.33.30.92, 13.33.30.96, 2600:9000:25fb:200:0:bcc9:f180:93a1, 2600:9000:25fb:2600:0:bcc9:f180:93a1, 2600:9000:25fb:4400:0:bcc9:f180:93a1, 2600:9000:25fb:7400:0:bcc9:f180:93a1, 2600:9000:25fb:8800:0:bcc9:f180:93a1, 2600:9000:25fb:a400:0:bcc9:f180:93a1, 2600:9000:25fb:b800:0:bcc9:f180:93a1, 2600:9000:25fb:f000:0:bcc9:f180:93a1
Response IP 13.33.30.101
Found Yes
Hash b2fa7ffdc852844c23e217c4338ba9ca4ded44f46cbf0ab3cdbc3875adf83c97
SimHash 63061052c8a3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-*.php
Disallow /wp-content/uploads/**/*.xlsx
Disallow /?p=*
Disallow /new-york?p=*
Disallow /sanfrancisco?p=*
Disallow /national?p=*
Disallow /miami?p=*
Disallow /la?p=*
Disallow /chicago?p=*
Disallow /?locale=*
Disallow /?altu=*
Allow *

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

facebookcatalog

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://therealdeal.com/sitemap_index.xml