comarcanoroeste.com
robots.txt

Robots Exclusion Standard data for comarcanoroeste.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	comarcanoroeste.com
Base Domain	comarcanoroeste.com
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-11-27T12:31:11+00:00
Next Scan	2026-02-25T12:31:11+00:00

Last Successful Scan

Scanned	2025-07-29T23:50:34+00:00
URL	https://comarcanoroeste.com/robots.txt
Domain IPs	85.208.102.225
Response IP	85.208.102.225
Found	Yes
Hash	7d834064d9d3265a59e6433ff6bce99154b5f4736c0d2217bf438bbbbfd04f98
SimHash	bc10bd4be174

Groups

*

Rule	Path
Disallow	/config/
Disallow	/system/
Disallow	/themes/
Disallow	/vendor/
Disallow	/cache/
Disallow	/encuesta/
Disallow	/changelog.txt
Disallow	/composer.json
Disallow	/composer.lock
Disallow	/composer.phar
Disallow	/search/
Disallow	/admin/
Allow	/themes/*/css/
Allow	/themes/*/images/
Allow	/themes/*/img/
Allow	/themes/*/js/
Allow	/themes/*/fonts/
Allow	/content/images/*.jpg
Allow	/content/images/*.png
Allow	/content/images/*.gif

Rule

Path

Disallow

/config/

Disallow

/system/

Disallow

/themes/

Disallow

/vendor/

Disallow

/cache/

Disallow

/encuesta/

Disallow

/changelog.txt

Disallow

/composer.json

Disallow

/composer.lock

Disallow

/composer.phar

Disallow

/search/

Disallow

/admin/

Allow

/themes/*/css/

Allow

/themes/*/images/

Allow

/themes/*/img/

Allow

/themes/*/js/

Allow

/themes/*/fonts/

Allow

/content/images/*.jpg

Allow

/content/images/*.png

Allow

/content/images/*.gif

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/wc/robots.html
For syntax checking, see:
http://www.sxw.org.uk/computing/robots/check.html
Disallow directories
Disallow files
Disallow paths
Allow themes
Allow content images

Back to top

comarcanoroeste.comrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Comments

comarcanoroeste.com
robots.txt