cs.linuxteaching.com
robots.txt

Robots Exclusion Standard data for cs.linuxteaching.com

Resource Scan

Scan Details

Site Domain cs.linuxteaching.com
Base Domain linuxteaching.com
Scan Status Ok
Last Scan2025-11-02T06:54:39+00:00
Next Scan 2025-12-02T06:54:39+00:00

Last Scan

Scanned2025-11-02T06:54:39+00:00
URL https://cs.linuxteaching.com/robots.txt
Domain IPs 104.21.58.172, 172.67.162.79, 2606:4700:3031::6815:3aac, 2606:4700:3032::ac43:a24f
Response IP 172.67.162.79
Found Yes
Hash 26a98349ad38e3d2e83583d3d1a443540e66c254147f43d094121c86d3a1f982
SimHash 4c255c4505d1

Groups

*

Rule Path
Disallow /admin

googlebot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://cs.linuxteaching.com/sitemap.xml

Warnings

  • `host` is not a known field.