myosceolalibrary.org
robots.txt

Robots Exclusion Standard data for myosceolalibrary.org

Resource Scan

Scan Details

Site Domain myosceolalibrary.org
Base Domain myosceolalibrary.org
Scan Status Ok
Last Scan2025-04-24T08:49:36+00:00
Next Scan 2025-05-24T08:49:36+00:00

Last Scan

Scanned2025-04-24T08:49:36+00:00
URL https://myosceolalibrary.org/robots.txt
Redirect https://www.myosceolalibrary.org/robots.txt
Redirect Domain www.myosceolalibrary.org
Redirect Base myosceolalibrary.org
Domain IPs 104.21.52.29, 172.67.194.156, 2606:4700:3031::6815:341d, 2606:4700:3032::ac43:c29c
Redirect IPs 104.21.52.29, 172.67.194.156, 2606:4700:3031::6815:341d, 2606:4700:3032::ac43:c29c
Response IP 172.67.194.156
Found Yes
Hash a80a47355646753f4ad696dd3ccd62bfc8d402d5d39135ff7d58e417b0629abe
SimHash 48645e82d792

Groups

*

Rule Path
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /trackback/
Disallow /wp-admin/
Disallow /*.inc$
Disallow */trackback/

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.myosceolalibrary.org/sitemap_index.xml