epmmop.gob.ec
robots.txt

Robots Exclusion Standard data for epmmop.gob.ec

Resource Scan

Scan Details

Site Domain epmmop.gob.ec
Base Domain epmmop.gob.ec
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-05-14T03:45:19+00:00
Next Scan 2024-05-28T03:45:19+00:00

Last Successful Scan

Scanned2024-04-06T02:29:58+00:00
URL https://epmmop.gob.ec/robots.txt
Domain IPs 186.46.83.251
Response IP 186.46.83.251
Found Yes
Hash 8d7589769b64f0829430e63a075603fcdabef2f726b6c911f130f8f626261633
SimHash 220cc5794184

Groups

*

Rule Path
Allow /*.js***************
Allow /*.css**************
Allow /*.png**************
Allow /*.jpg**************
Allow /*.jpeg**************
Allow /*.gif**************
Allow /*.eot**************
Allow /*.woff**************
Allow /*.ttf**************
Allow /*.svg**************
Allow /*.otf**************
Allow /*.pdf**************
Allow /*.PNG**************
Allow /*.JPG**************
Allow /*.JPEG**************
Allow /*.mp3**************
Allow /*.pdf**************
Disallow /administrator/
Disallow /cache/
Disallow /cdn/
Disallow /cgi-bin/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /lists/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /t3-assets/
Disallow /tmp/
Disallow /videos/

Comments

  • Please don't remove folders from disallow.
  • The allows at the top allow any of the mimetypes listed to be crawled within any folder
  • using long-tail wildcards, these ignore the disallows for the folders below.
  • This gives full render for the search engines whilst preventing full crawls of system
  • folders
  • THIS ALLOWS FULL RENDER AT ENGINES
  • THESE FOLDERS SHOULD NEVER BE CRAWLED