greatexpectationsrealty.com
robots.txt

Robots Exclusion Standard data for greatexpectationsrealty.com

Resource Scan

Scan Details

Site Domain greatexpectationsrealty.com
Base Domain greatexpectationsrealty.com
Scan Status Ok
Last Scan2026-02-04T10:07:13+00:00
Next Scan 2026-02-18T10:07:13+00:00

Last Scan

Scanned2026-02-04T10:07:13+00:00
URL https://greatexpectationsrealty.com/robots.txt
Domain IPs 104.21.11.189, 172.67.192.99, 2606:4700:3035::6815:bbd, 2606:4700:3037::ac43:c063
Response IP 172.67.192.99
Found Yes
Hash 4d0b9e5c2b8b682460fe86809922156a32c1e44a1335a401a4bda93df5c0b6a9
SimHash 68455c825882

Groups

*

Rule Path
Allow /wp-content/uploads/
Disallow /wp-content/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /cgi-bin/
Allow /feed/$
Disallow /feed
Disallow /comments/feed
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /?s=
Disallow /search
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.wmv$
Disallow /*.cgi$
Disallow /*.xhtml$

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

claritybot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://greatexpectationsrealty.com/sitemap_index.xml