guatushe.com
robots.txt

Robots Exclusion Standard data for guatushe.com

Resource Scan

Scan Details

Site Domain guatushe.com
Base Domain guatushe.com
Scan Status Ok
Last Scan2026-01-20T13:43:48+00:00
Next Scan 2026-02-19T13:43:48+00:00

Last Scan

Scanned2026-01-20T13:43:48+00:00
URL https://guatushe.com/robots.txt
Redirect https://www.guatushe.com/robots.txt
Redirect Domain www.guatushe.com
Redirect Base guatushe.com
Domain IPs 206.119.179.64
Redirect IPs 206.119.179.64
Response IP 206.119.179.64
Found Yes
Hash 09d3d6930944f4b3382d81514891fd35e3bbc281fd60cc0f9c1a82ef61f0bb39
SimHash d15dd8c08e8a

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-include/
Disallow /xui/
Disallow /tmui/
Disallow /label/
Disallow /seeyon/
Disallow /Inc/
Disallow /*.zip
Disallow /*.rar
Disallow /*.asp
Disallow /*.ico
Disallow /*.aspx
Disallow /*?*
Disallow /*feed*
Disallow /wp-json/

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

leiki

Rule Path
Disallow /

webmeup

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

tracking bot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

awariobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.guatushe.com/sitemap.xml