preview.therealdeal.com
robots.txt

Robots Exclusion Standard data for preview.therealdeal.com

Resource Scan

Scan Details

Site Domain preview.therealdeal.com
Base Domain therealdeal.com
Scan Status Ok
Last Scan2024-05-22T09:23:35+00:00
Next Scan 2024-06-05T09:23:35+00:00

Last Scan

Scanned2024-05-22T09:23:35+00:00
URL https://preview.therealdeal.com/robots.txt
Redirect https://therealdeal.com/robots.txt
Redirect Domain therealdeal.com
Redirect Base therealdeal.com
Domain IPs 104.22.12.16, 104.22.13.16, 172.67.14.92, 2606:4700:10::6816:c10, 2606:4700:10::6816:d10, 2606:4700:10::ac43:e5c
Redirect IPs 13.33.30.101, 13.33.30.89, 13.33.30.92, 13.33.30.96, 2600:9000:260e:600:0:bcc9:f180:93a1, 2600:9000:260e:6e00:0:bcc9:f180:93a1, 2600:9000:260e:7200:0:bcc9:f180:93a1, 2600:9000:260e:7400:0:bcc9:f180:93a1, 2600:9000:260e:8c00:0:bcc9:f180:93a1, 2600:9000:260e:9400:0:bcc9:f180:93a1, 2600:9000:260e:9800:0:bcc9:f180:93a1, 2600:9000:260e:b200:0:bcc9:f180:93a1
Response IP 13.33.30.89
Found Yes
Hash b2fa7ffdc852844c23e217c4338ba9ca4ded44f46cbf0ab3cdbc3875adf83c97
SimHash 63061052c8a3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-*.php
Disallow /wp-content/uploads/**/*.xlsx
Disallow /?p=*
Disallow /new-york?p=*
Disallow /sanfrancisco?p=*
Disallow /national?p=*
Disallow /miami?p=*
Disallow /la?p=*
Disallow /chicago?p=*
Disallow /?locale=*
Disallow /?altu=*
Allow *

twitterbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

facebookcatalog

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://therealdeal.com/sitemap_index.xml