guyisa.com
robots.txt

Robots Exclusion Standard data for guyisa.com

Resource Scan

Scan Details

Site Domain guyisa.com
Base Domain guyisa.com
Scan Status Ok
Last Scan2025-10-12T18:02:31+00:00
Next Scan 2025-10-19T18:02:31+00:00

Last Scan

Scanned2025-10-12T18:02:31+00:00
URL https://guyisa.com/robots.txt
Domain IPs 104.18.8.146
Response IP 104.18.8.146
Found Yes
Hash c38ca4f7cec091921e9ce0aa1e5e8daafc7af19d4a8db6eae5bf446b9ceb7ea8
SimHash 287d02f26e81

Groups

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

researchscan

Rule Path
Disallow /

dnyzbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

uptimerobot

Rule Path
Disallow /

yunsecuritybot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

powermarks

Rule Path
Disallow /

linguee

Rule Path
Disallow /

*

Rule Path
Disallow /inc/
Allow /static/js/
Allow /static/css/
Allow /static/themes/
Allow /static/themes-v2/
Disallow /static/
Disallow /account/
Disallow /tmp/
Disallow /ajax/
Disallow /cdn-cgi/
Disallow /v-code/

Other Records

Field Value
sitemap https://guyisa.com/guyisa-com-sitemap.xml