1023therose.com
robots.txt

Robots Exclusion Standard data for 1023therose.com

Resource Scan

Scan Details

Site Domain 1023therose.com
Base Domain 1023therose.com
Scan Status Ok
Last Scan2024-05-12T19:52:05+00:00
Next Scan 2024-05-19T19:52:05+00:00

Last Scan

Scanned2024-05-12T19:52:05+00:00
URL https://1023therose.com/robots.txt
Redirect https://www.1023therose.com/?robots=1
Redirect Domain www.1023therose.com
Redirect Base 1023therose.com
Domain IPs 104.21.85.69, 172.67.203.28, 2606:4700:3030::ac43:cb1c, 2606:4700:3033::6815:5545
Redirect IPs 104.21.85.69, 172.67.203.28, 2606:4700:3030::ac43:cb1c, 2606:4700:3033::6815:5545
Response IP 104.21.85.69
Found Yes
Hash 02f194e315964c4ce64c8b7212bfdd992623b5ebee5cbf5064c259a3813faabf
SimHash 62011c908f95

Groups

twitterbot

Rule Path
Disallow /calendar/action*
Disallow /events/action*
Disallow /wp-admin
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php
Allow /wp-content/themes/blacklab/assets/*?
Allow /syndicated-article/*?

Other Records

Field Value
crawl-delay 30

facebookexternalhit

Rule Path
Allow *

*

Rule Path
Disallow /calendar/action*
Disallow /events/action*
Disallow /wp-admin
Disallow /syndicated-article/*?
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php
Allow /wp-content/themes/blacklab/assets/*?

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.1023therose.com/sitemap.xml