pt.linuxteaching.com
robots.txt

Robots Exclusion Standard data for pt.linuxteaching.com

Resource Scan

Scan Details

Site Domain pt.linuxteaching.com
Base Domain linuxteaching.com
Scan Status Ok
Last Scan2025-10-15T04:49:03+00:00
Next Scan 2025-11-14T04:49:03+00:00

Last Scan

Scanned2025-10-15T04:49:03+00:00
URL https://pt.linuxteaching.com/robots.txt
Domain IPs 104.21.58.172, 172.67.162.79, 2606:4700:3031::6815:3aac, 2606:4700:3032::ac43:a24f
Response IP 172.67.162.79
Found Yes
Hash 85b1ee4c88b74c8a12fcf6fa194948e0d150d922d85d5b87dc6702793ef18247
SimHash 4c645c45c590

Groups

*

Rule Path
Disallow /admin

googlebot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://pt.linuxteaching.com/sitemap.xml

Warnings

  • `host` is not a known field.