wconline.com
robots.txt
Robots Exclusion Standard data for wconline.com
Resource Scan
Scan Details
Site Domain | wconline.com |
Base Domain | wconline.com |
Scan Status | Ok |
Last Scan | 2024-11-16T19:05:40+00:00 |
Next Scan | 2024-11-23T19:05:40+00:00 |
Last Scan
Scanned | 2024-11-16T19:05:40+00:00 |
URL | https://wconline.com/robots.txt |
Redirect | https://www.wconline.com/robots.txt |
Redirect Domain | www.wconline.com |
Redirect Base | wconline.com |
Domain IPs | 104.21.77.24, 172.67.203.143, 2606:4700:3032::ac43:cb8f, 2606:4700:3033::6815:4d18 |
Redirect IPs | 104.21.77.24, 172.67.203.143, 2606:4700:3032::ac43:cb8f, 2606:4700:3033::6815:4d18 |
Response IP | 172.67.203.143 |
Found | Yes |
Hash | a6cb850a1bbf869db6ed0f85e09ab86ef8dc3fdd16b0e20c88a275393c49c15b |
SimHash | eb9c0f1d5672 |
Groups
*
Rule | Path |
---|---|
Disallow | /comments/flag/ |
Disallow | /search |
Disallow | /articles/comment/abuse |
Disallow | /articles/email |
Disallow | /articles/preview |
Disallow | /articles/print |
Disallow | /products/email |
Disallow | /products/print |
Disallow | /cart |
Disallow | /user/* |
Disallow | /*/log_view |
Disallow | /query/* |
Disallow | /media/video/ |
Allow | /media/videos/play/* |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
Other Records
Field | Value |
---|---|
sitemap | https://www.wconline.com/sitemap.xml |
Comments