annehogan.net
robots.txt

Robots Exclusion Standard data for annehogan.net

Resource Scan

Scan Details

Site Domain annehogan.net
Base Domain annehogan.net
Scan Status Ok
Last Scan4/3/2025, 4:09:07 PM
Next Scan 5/3/2025, 4:09:07 PM

Last Scan

Scanned4/3/2025, 4:09:07 PM
URL https://annehogan.net/robots.txt
Domain IPs 104.21.49.114, 172.67.162.6, 2606:4700:3033::6815:3172, 2606:4700:3033::ac43:a206
Response IP 104.21.49.114
Found Yes
Hash 91fcda7e01e30c672c3f49aa3b9732d31510a4c3bb14577a77da06b704e5209d
SimHash 6bb9b9628a02

Groups

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

riddler

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

detectify

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

flipboardproxy

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /feed
Disallow */feed
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-json/
Disallow /xmlrpc.php
Disallow /readme.html
Allow /wp-includes/*.css
Allow /wp-includes/*.js
Allow /wp-content/plugins/*.css
Allow /wp-content/plugins/*.js
Allow /*.css
Allow /*.js

Other Records

Field Value
sitemap https://annehogan.net/sitemap.xml