webchick.com
robots.txt
Robots Exclusion Standard data for webchick.com
Resource Scan
Scan Details
Site Domain | webchick.com |
Base Domain | webchick.com |
Scan Status | Ok |
Last Scan | 2025-10-14T16:43:18+00:00 |
Next Scan | 2025-11-13T16:43:18+00:00 |
Last Scan
Scanned | 2025-10-14T16:43:18+00:00 |
URL | https://webchick.com/robots.txt |
Redirect | https://www.webchick.com/robots.txt |
Redirect Domain | www.webchick.com |
Redirect Base | webchick.com |
Domain IPs | 72.167.48.191 |
Redirect IPs | 72.167.48.191 |
Response IP | 72.167.48.191 |
Found | Yes |
Hash | af3bc9af1a1a054d4aa7eee74c272a0b88d7799a15c90226caa3c09f02d579b8 |
SimHash | 3706cc447497 |
Groups
*
Rule | Path |
---|---|
Disallow | /anon_ftp |
Disallow | /cgi-bin |
Disallow | /conf |
Disallow | /error_docs |
Disallow | /etc |
Disallow | /img |
Disallow | /picture_library |
Disallow | /pd |
Disallow | /private |
Disallow | /statistics |
Disallow | /subdomains |
Disallow | /web_users |
Other Records
Field | Value |
---|---|
sitemap | https://www.webchick.com/sitemap.xml |
Comments