w3cschool.cn
robots.txt

Robots Exclusion Standard data for w3cschool.cn

Resource Scan

Scan Details

Site Domain w3cschool.cn
Base Domain w3cschool.cn
Scan Status Ok
Last Scan2025-11-03T22:08:14+00:00
Next Scan 2025-11-10T22:08:14+00:00

Last Scan

Scanned2025-11-03T22:08:14+00:00
URL https://w3cschool.cn/robots.txt
Redirect https://www.w3cschool.cn/robots.txt
Redirect Domain www.w3cschool.cn
Redirect Base w3cschool.cn
Domain IPs 120.79.88.157
Redirect IPs 47.106.199.63
Response IP 47.106.199.63
Found Yes
Hash 122d133fd7fe43384592c7e2c46e9e8c62a955f34611ef1c4554d7f97a8cc4ac
SimHash 1c234a457e10

Groups

mediapartners-google

Rule Path
Allow /

*

Rule Path
Disallow /channel/
Disallow /data/
Disallow /u/
Disallow /edit/
Disallow /include/
Disallow /project/
Disallow /register?*
Disallow /login?*
Disallow /logout?*
Disallow /explore?*
Disallow /search?*
Disallow /project/list?*
Disallow /article?*tpl=article2
Disallow /api/
Disallow /training/
Disallow /training/*

Comments

  • robots.txt for w3cschool
  • Version 1.1.8 2020.02.08