inwcat.com
robots.txt

Robots Exclusion Standard data for inwcat.com

Resource Scan

Scan Details

Site Domain inwcat.com
Base Domain inwcat.com
Scan Status Ok
Last Scan2024-09-23T06:13:28+00:00
Next Scan 2024-09-30T06:13:28+00:00

Last Scan

Scanned2024-09-23T06:13:28+00:00
URL https://inwcat.com/robots.txt
Domain IPs 104.21.12.155, 172.67.132.46, 2606:4700:3031::6815:c9b, 2606:4700:3033::ac43:842e
Response IP 104.21.12.155
Found Yes
Hash 03b064c165977419f19868a394cde88c2c6293f96d429031ca5dcd3d4711b8f1
SimHash 2d225e362230

Groups

googlebot-image

Rule Path
Allow /

yandeximages

Rule Path
Allow /

msnbot-mm

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

yandeximageresizer

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

*

Rule Path
Allow /$
Allow /*.xml
Allow /*sitemap
Disallow /admin
Allow /*board*.html$
Allow /*topic*.html$
Allow /profile*
Allow /tags*
Disallow /Packages/
Disallow /Smileys/
Disallow /Sources/
Disallow /Themes/
Disallow /*PHPSESSID

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.inwcat.com/sitemap.xml
sitemap http://www.inwcat.com/sitemap_mobile.xml