sprea.it
robots.txt

Robots Exclusion Standard data for sprea.it

Resource Scan

Scan Details

Site Domain sprea.it
Base Domain sprea.it
Scan Status Ok
Last Scan2024-10-18T21:53:08+00:00
Next Scan 2024-11-17T21:53:08+00:00

Last Scan

Scanned2024-10-18T21:53:08+00:00
URL https://sprea.it/robots.txt
Domain IPs 136.243.174.217
Response IP 136.243.174.217
Found Yes
Hash 4148a2713ee0511e55222531e56bcaf592d103caaea20a053b608493d1eac264
SimHash 78279405ef3e

Groups

*

Rule Path
Disallow /coupon/
Disallow /materiali
Disallow /sprea.xml

Other Records

Field Value
sitemap https://sprea.it/sitemap.xml
sitemap https://sprea.it/sitemap-tipi-riviste.xml
sitemap https://sprea.it/sitemap-abbonamento.xml
sitemap https://sprea.it/sitemap-arretrati.xml

Comments

  • Non fa indicizzare le pagine coi coupon sconto
  • Non fa indicizzare le cartella public che duplica il sito (wtf)
  • Disallow: /public
  • Non fa indicizzare il contenuto di /materiali