inwcat.com
robots.txt

Robots Exclusion Standard data for inwcat.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	inwcat.com
Base Domain	inwcat.com
Scan Status	Ok
Last Scan	2024-09-23T06:13:28+00:00
Next Scan	2024-09-30T06:13:28+00:00

Last Scan

Scanned	2024-09-23T06:13:28+00:00
URL	https://inwcat.com/robots.txt
Domain IPs	104.21.12.155, 172.67.132.46, 2606:4700:3031::6815:c9b, 2606:4700:3033::ac43:842e
Response IP	104.21.12.155
Found	Yes
Hash	03b064c165977419f19868a394cde88c2c6293f96d429031ca5dcd3d4711b8f1
SimHash	2d225e362230

Groups

googlebot-image

Rule	Path
Allow	/

Rule

Path

Allow

/

yandeximages

Rule	Path
Allow	/

Rule

Path

Allow

/

msnbot-mm

Rule	Path
Allow	/

Rule

Path

Allow

/

googlebot-mobile

Rule	Path
Allow	/

Rule

Path

Allow

/

yandeximageresizer

Rule	Path
Allow	/

Rule

Path

Allow

/

mediapartners-google

Rule	Path
Allow	/

Rule

Path

Allow

/

*

Rule	Path
Allow	/$
Allow	/*.xml
Allow	/*sitemap
Disallow	/admin
Allow	/board.html$
Allow	/topic.html$
Allow	/profile*
Allow	/tags*
Disallow	/Packages/
Disallow	/Smileys/
Disallow	/Sources/
Disallow	/Themes/
Disallow	/*PHPSESSID

Rule

Path

Allow

/$

Allow

/*.xml

Allow

/*sitemap

Disallow

/admin

Allow

/*board*.html$

Allow

/*topic*.html$

Allow

/profile*

Allow

/tags*

Disallow

/Packages/

Disallow

/Smileys/

Disallow

/Sources/

Disallow

/Themes/

Disallow

/*PHPSESSID

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

Back to top

Other Records

Field	Value
sitemap	https://www.inwcat.com/sitemap.xml
sitemap	http://www.inwcat.com/sitemap_mobile.xml

Field

Value

sitemap

https://www.inwcat.com/sitemap.xml

sitemap

http://www.inwcat.com/sitemap_mobile.xml

Back to top

inwcat.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

googlebot-image

yandeximages

msnbot-mm

googlebot-mobile

yandeximageresizer

mediapartners-google

*

Other Records

Other Records

inwcat.com
robots.txt