therealdeal.com
robots.txt

Robots Exclusion Standard data for therealdeal.com

Resource Scan

Scan Details

Site Domain therealdeal.com
Base Domain therealdeal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-14T02:56:50+00:00
Next Scan 2024-10-14T02:56:50+00:00

Last Successful Scan

Scanned2024-08-16T02:55:28+00:00
URL https://therealdeal.com/robots.txt
Domain IPs 13.33.30.101, 13.33.30.89, 13.33.30.92, 13.33.30.96, 2600:9000:229f:1400:0:bcc9:f180:93a1, 2600:9000:229f:5800:0:bcc9:f180:93a1, 2600:9000:229f:600:0:bcc9:f180:93a1, 2600:9000:229f:a800:0:bcc9:f180:93a1, 2600:9000:229f:c800:0:bcc9:f180:93a1, 2600:9000:229f:d800:0:bcc9:f180:93a1, 2600:9000:229f:f200:0:bcc9:f180:93a1, 2600:9000:229f:fc00:0:bcc9:f180:93a1
Response IP 13.33.30.89
Found Yes
Hash b2fa7ffdc852844c23e217c4338ba9ca4ded44f46cbf0ab3cdbc3875adf83c97
SimHash 63061052c8a3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-*.php
Disallow /wp-content/uploads/**/*.xlsx
Disallow /?p=*
Disallow /new-york?p=*
Disallow /sanfrancisco?p=*
Disallow /national?p=*
Disallow /miami?p=*
Disallow /la?p=*
Disallow /chicago?p=*
Disallow /?locale=*
Disallow /?altu=*
Allow *

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

facebookcatalog

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://therealdeal.com/sitemap_index.xml