blog.web20classroom.org
robots.txt

Robots Exclusion Standard data for blog.web20classroom.org

Resource Scan

Scan Details

Site Domain blog.web20classroom.org
Base Domain web20classroom.org
Scan Status Ok
Last Scan2024-05-31T06:05:30+00:00
Next Scan 2024-06-30T06:05:30+00:00

Last Scan

Scanned2024-05-31T06:05:30+00:00
URL https://blog.web20classroom.org/robots.txt
Domain IPs 2404:6800:4003:c01::79, 74.125.130.121
Response IP 172.217.194.121
Found Yes
Hash ebddeed7b34fae5e864a9f6d141bdce2e9fb3d3d79a4d0c0e3e692434e3340e8
SimHash 6b0492404792

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://blog.web20classroom.org/sitemap.xml