awakentherebel.com
robots.txt

Robots Exclusion Standard data for awakentherebel.com

Resource Scan

Scan Details

Site Domain awakentherebel.com
Base Domain awakentherebel.com
Scan Status Ok
Last Scan2026-04-03T03:53:33+00:00
Next Scan 2026-05-03T03:53:33+00:00

Last Scan

Scanned2026-04-03T03:53:33+00:00
URL https://awakentherebel.com/robots.txt
Domain IPs 104.21.35.162, 172.67.177.162, 2606:4700:3033::ac43:b1a2, 2606:4700:3035::6815:23a2
Response IP 172.67.177.162
Found Yes
Hash 7323386b710fdb133d1d0415f2dbdb690855ac3cc357567985ba89c2637a2cd6
SimHash 0a345971e771

Groups

*

Rule Path
Disallow /search
Disallow /admin
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://awakentherebel.com/sitemap.xml