curiosmos.com
robots.txt

Robots Exclusion Standard data for curiosmos.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	curiosmos.com
Base Domain	curiosmos.com
Scan Status	Ok
Last Scan	2024-11-08T14:07:14+00:00
Next Scan	2024-11-15T14:07:14+00:00

Last Scan

Scanned	2024-11-08T14:07:14+00:00
URL	https://curiosmos.com/robots.txt
Domain IPs	104.21.91.188, 172.67.177.214, 2606:4700:3030::6815:5bbc, 2606:4700:3033::ac43:b1d6
Response IP	104.21.91.188
Found	Yes
Hash	1c0ae54e28385889ad3872de18c49e2ffd2325f5d9f1c8dcda6b922792b9b2d3
SimHash	0088c9605e38

Groups

*

Rule	Path
Disallow	/dev
Disallow	/dev/
Disallow	/cgi-bin
Disallow	/wp-
Disallow	/author/
Disallow	*?attachment_id=
Allow	/wp-content/uploads/
Allow	/wp-content/themes/
Allow	//.js
Allow	//.css
Allow	/wp-*.png
Allow	/wp-*.jpg
Allow	/wp-*.jpeg
Allow	/wp-*.gif
Allow	/wp-*.svg
Allow	/wp-*.pdf

Rule

Path

Disallow

/dev

Disallow

/dev/

Disallow

/cgi-bin

Disallow

/wp-

Disallow

/author/

Disallow

*?attachment_id=

Allow

/wp-content/uploads/

Allow

/wp-content/themes/

Allow

/*/*.js

Allow

/*/*.css

Allow

/wp-*.png

Allow

/wp-*.jpg

Allow

/wp-*.jpeg

Allow

/wp-*.gif

Allow

/wp-*.svg

Allow

/wp-*.pdf

Back to top

Other Records

Field	Value
sitemap	https://curiosmos.com/sitemap_index.xml

Field

Value

sitemap

https://curiosmos.com/sitemap_index.xml

Back to top

curiosmos.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

curiosmos.com
robots.txt