inwcat.com
robots.txt
Robots Exclusion Standard data for inwcat.com
Resource Scan
Scan Details
Site Domain | inwcat.com |
Base Domain | inwcat.com |
Scan Status | Ok |
Last Scan | 2024-09-23T06:13:28+00:00 |
Next Scan | 2024-09-30T06:13:28+00:00 |
Last Scan
Scanned | 2024-09-23T06:13:28+00:00 |
URL | https://inwcat.com/robots.txt |
Domain IPs | 104.21.12.155, 172.67.132.46, 2606:4700:3031::6815:c9b, 2606:4700:3033::ac43:842e |
Response IP | 104.21.12.155 |
Found | Yes |
Hash | 03b064c165977419f19868a394cde88c2c6293f96d429031ca5dcd3d4711b8f1 |
SimHash | 2d225e362230 |
Groups
*
Rule | Path |
---|---|
Allow | /$ |
Allow | /*.xml |
Allow | /*sitemap |
Disallow | /admin |
Allow | /*board*.html$ |
Allow | /*topic*.html$ |
Allow | /profile* |
Allow | /tags* |
Disallow | /Packages/ |
Disallow | /Smileys/ |
Disallow | /Sources/ |
Disallow | /Themes/ |
Disallow | /*PHPSESSID |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |
Other Records
Field | Value |
---|---|
sitemap | https://www.inwcat.com/sitemap.xml |
sitemap | http://www.inwcat.com/sitemap_mobile.xml |