51cto.com
robots.txt

Robots Exclusion Standard data for 51cto.com

Resource Scan

Scan Details

Site Domain 51cto.com
Base Domain 51cto.com
Scan Status Ok
Last Scan2024-06-23T16:31:09+00:00
Next Scan 2024-06-30T16:31:09+00:00

Last Scan

Scanned2024-06-23T16:31:09+00:00
URL https://51cto.com/robots.txt
Domain IPs 203.107.44.140
Response IP 203.107.44.140
Found Yes
Hash 405c6bcd564d143fca974f9a14ecb211534a4ab0965dac490ee47e3c514e6a64
SimHash 3a498ec4eb91

Groups

*

Rule Path
Disallow /assets/
Disallow /static/
Disallow /plugin/
Disallow /tinymce/
Disallow /_ctoweb/
Disallow /php*
Disallow /*?*

Other Records

Field Value
sitemap https://www.51cto.com/sitemap/google/index.xml
sitemap https://www.51cto.com/sitemap/google/index_detail.xml
sitemap https://www.51cto.com/sitemap/google/list.xml
sitemap https://www.51cto.com/sitemap/google/others.xml