webguy.io
robots.txt

Robots Exclusion Standard data for webguy.io

Resource Scan

Scan Details

Site Domain webguy.io
Base Domain webguy.io
Scan Status Ok
Last Scan2025-12-17T17:30:44+00:00
Next Scan 2025-12-24T17:30:44+00:00

Last Scan

Scanned2025-12-17T17:30:44+00:00
URL https://webguy.io/robots.txt
Domain IPs 104.21.88.125, 172.67.179.94, 2606:4700:3036::ac43:b35e, 2606:4700:3037::6815:587d
Response IP 172.67.179.94
Found Yes
Hash d8b15744b08be1992ee7b8ba0aabc27a0d341c48dfa88df432aceffb9d3414b2
SimHash 8905dcf2c773

Groups

*

Rule Path
Allow /
Disallow /projects
Disallow /*~*
Disallow /*~

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://webguy.io/sitemap.xml