health2022.wired.it
robots.txt

Robots Exclusion Standard data for health2022.wired.it

Resource Scan

Scan Details

Site Domain health2022.wired.it
Base Domain wired.it
Scan Status Ok
Last Scan2024-04-25T14:18:42+00:00
Next Scan 2024-05-25T14:18:42+00:00

Last Scan

Scanned2024-04-25T14:18:42+00:00
URL https://health2022.wired.it/robots.txt
Domain IPs 13.33.21.106, 13.33.21.13, 13.33.21.46, 13.33.21.70
Response IP 18.165.171.110
Found Yes
Hash d7bb9b7b9410792afc1e2144d13fb9a492bf192697f6aee7a500a4bd34c3ef3e
SimHash 88404a66273d

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/

*

Rule Path
Disallow /search?q=
Disallow /esi/
Disallow /wp-admin/
Disallow */WP_HOME/*
Disallow */cartella/*

ia_archiver-web.archive.org

Rule Path
Allow /
Disallow /trackback/
Disallow /feed/
Disallow /comments/
Disallow */trackback/
Disallow */feed/
Disallow */comments/

grapeshot

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.wired.it/sitemap_index.xml
sitemap https://next.wired.it/sitemap_index.xml

Comments

  • --------------------------------------
  • Articoli con /WP_HOME/
  • --------------------------------------
  • --------------------------------------
  • Paginazione delle categore
  • --------------------------------------
  • Disallow: */pag/*
  • --------------------------------------
  • Paginazione dei topic
  • --------------------------------------
  • Disallow: /topic/*/pag/*
  • --------------------------------------
  • Filtri su Pagine di Categoria
  • --------------------------------------