caprocat.com
robots.txt

Robots Exclusion Standard data for caprocat.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	caprocat.com
Base Domain	caprocat.com
Scan Status	Ok
Last Scan	2024-05-29T03:04:10+00:00
Next Scan	2024-06-28T03:04:10+00:00

Last Scan

Scanned	2024-05-29T03:04:10+00:00
URL	https://caprocat.com/robots.txt
Domain IPs	18.161.6.23, 18.161.6.61, 18.161.6.82, 18.161.6.98
Response IP	108.157.52.119
Found	Yes
Hash	bdd072c4f9251f3e77f8b72ebcda4ef93946dd6bf53e2331a26e07b681f3b02c
SimHash	6b0c4878c338

Groups

*

Rule	Path
Allow	/*.css$
Allow	/*.js$
Disallow	//?slug=
Disallow	///?slug=*
Disallow	/wp-login
Disallow	/*/feed/
Disallow	/*/trackback/
Disallow	/*/attachment/
Disallow	/?attachment_id*
Disallow	/comments/
Disallow	/xmlrpc.php
Disallow	/*?s=
Disallow	/?s=*

Rule

Path

Allow

/*.css$

Allow

/*.js$

Disallow

/*/?slug=*

Disallow

/*/*/?slug=*

Disallow

/wp-login

Disallow

/*/feed/

Disallow

/*/trackback/

Disallow

/*/attachment/

Disallow

/?attachment_id*

Disallow

/comments/

Disallow

/xmlrpc.php

Disallow

/*?s=

Disallow

/?s=*

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

Back to top

Other Records

Field	Value
sitemap	https://caprocat.com/sitemap_index.xml

Field

Value

sitemap

https://caprocat.com/sitemap_index.xml

Back to top

caprocat.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Other Records

caprocat.com
robots.txt