inesem.es
robots.txt

Robots Exclusion Standard data for inesem.es

Resource Scan

Scan Details

Site Domain inesem.es
Base Domain inesem.es
Scan Status Ok
Last Scan2024-05-22T16:47:14+00:00
Next Scan 2024-06-21T16:47:14+00:00

Last Scan

Scanned2024-05-22T16:47:14+00:00
URL https://inesem.es/robots.txt
Redirect https://www.inesem.es/robots.txt
Redirect Domain www.inesem.es
Redirect Base inesem.es
Domain IPs 188.40.221.164
Redirect IPs 23.44.4.209, 2600:1413:1::48f7:7feb, 2600:1413:1::7d38:db4b
Response IP 42.99.140.185
Found Yes
Hash bf071cbc6f05027f78256a6756bc3abf0f8cc0ddfb050de7f970029f7cf1279d
SimHash b9d4bd5b1742

Groups

*

Rule Path
Allow /core/*.css$
Allow /core/*.css?
Allow /core/*.js$
Allow /core/*.js?
Allow /core/*.gif
Allow /core/*.jpg
Allow /core/*.jpeg
Allow /core/*.png
Allow /core/*.svg
Allow /profiles/*.css$
Allow /profiles/*.css?
Allow /profiles/*.js$
Allow /profiles/*.js?
Allow /profiles/*.gif
Allow /profiles/*.jpg
Allow /profiles/*.jpeg
Allow /profiles/*.png
Allow /profiles/*.svg
Allow /revistadigital/wp-content/uploads/*
Allow /revistadigital/wp-content/*.js$
Allow /revistadigital/wp-content/*.js?*
Allow /revistadigital/wp-content/*.css$
Allow /revistadigital/wp-content/*.css?*
Allow /revistadigital/wp-content/*.jpg$
Allow /revistadigital/wp-content/*.png$
Allow /revistadigital/wp-content/*.svg$
Allow /revistadigital/wp-includes/*.js$
Allow /revistadigital/wp-includes/*.css$
Allow /revistadigital/wp-json/
Disallow /revistadigital/revistadigital/cgi-bin
Disallow /revistadigital/wp-admin/
Disallow /revistadigital/wp-includes/
Disallow /revistadigital/wp-content/
Disallow /revistadigital/wp-content/plugins/
Disallow /revistadigital/wp-content/themes/
Disallow /revistadigital/acceder
Disallow */feed/
Disallow *?feed*
Disallow /*/attachment/
Disallow */tag/*/page/
Disallow */tag/*/feed/
Disallow /revistadigital/xmlrpc.php
Disallow /*/xmlrpc.php
Disallow /*/*/xmlrpc.php
Disallow /revistadigital/?attachment_id*
Disallow /*?*
Disallow /core/
Disallow /profiles/
Disallow /akamai/

*

Rule Path
Disallow /revistadigital/trackback
Disallow /revistadigital/*trackback
Disallow /revistadigital/*trackback*
Disallow /revistadigital/*/trackback
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips
Disallow /index.php/node/add/
Disallow /index.php/user/password/
Disallow /index.php/user/register/
Disallow /index.php/user/login/
Disallow /index.php/user/logout/
Disallow /*?id_simo=*&userAgent=*
Disallow *%26userAgent%3D*
Disallow *%26refererURL%3D*
Disallow /articulos_revista_relacionados/posts

*

Rule Path
Allow /revistadigital/feed/$
Disallow /revistadigital/feed/
Disallow /revistadigital/comments/feed/
Disallow /revistadigital/*/feed/$
Disallow /revistadigital/*/feed/rss/$
Disallow /revistadigital/*/trackback/$
Disallow /revistadigital/*/*/feed/$
Disallow /revistadigital/*/*/feed/rss/$
Disallow /revistadigital/*/*/trackback/$
Disallow /revistadigital/*/*/*/feed/$
Disallow /revistadigital/*/*/*/feed/rss/$
Disallow /revistadigital/*/*/*/trackback/$
Disallow *?wordfence*
Disallow /get_opiniones/*
Disallow /territorio-inesem/webinars-y-podcast/

Other Records

Field Value
sitemap https://www.inesem.es/articulos_investigacion/sitemap.xml
sitemap https://www.inesem.es/cursos/sitemap.xml
sitemap https://www.inesem.es/paginas/sitemap.xml
sitemap https://www.inesem.es/salidas_profesionales/sitemap.xml
sitemap https://www.inesem.es/mapa-sitio-curso
sitemap https://www.inesem.es/revistadigital/sitemap_index.xml
sitemap https://www.inesem.es/revistadigital/articulos/sitemap.xml
sitemap https://www.inesem.es/revistadigital/paginas/sitemap.xml

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • CSS, JS, Images
  • Bloqueo de URLS dinamicas
  • Directories
  • Bloqueo de trackbacks
  • Files
  • Paths (clean URLs)
  • Disallow: /search/
  • Paths (no clean URLs)
  • Disallow: /index.php/search/
  • Bloqueo de feeds para crawlers
  • Sitemap
  • Sitemap revista