webs.com.gt
robots.txt

Robots Exclusion Standard data for webs.com.gt

Resource Scan

Scan Details

Site Domain webs.com.gt
Base Domain webs.com.gt
Scan Status Ok
Last Scan2026-01-05T19:14:14+00:00
Next Scan 2026-01-12T19:14:14+00:00

Last Scan

Scanned2026-01-05T19:14:14+00:00
URL https://webs.com.gt/robots.txt
Domain IPs 104.21.60.242, 172.67.202.199, 2606:4700:3032::6815:3cf2, 2606:4700:3037::ac43:cac7
Response IP 104.21.60.242
Found Yes
Hash 2369888365fb5bd2d82bb3920d6fa3c8edb02d22982a499eda92631f3bec8aa3
SimHash a55e19408adb

Groups

*

Rule Path
Disallow /wp-admin/*
Disallow /wp-login.php
Disallow /wp-content/themes/*
Disallow /wp-content/plugins/*
Disallow /trackback
Disallow /cgi-bin
Disallow /users/
Disallow */trackback
Disallow */rss
Disallow */embed
Disallow /xmlrpc.php
Disallow *utm%3D
Disallow *openstat%3D
Disallow /readme.html
Disallow /?*

Other Records

Field Value
sitemap https://webs.com.gt/post-sitemap.xml

Warnings

  • 1 invalid line.