salerno.corriere.it
robots.txt

Robots Exclusion Standard data for salerno.corriere.it

Resource Scan

Scan Details

Site Domain salerno.corriere.it
Base Domain corriere.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-01-13T10:01:40+00:00
Next Scan 2025-01-14T10:01:40+00:00

Last Successful Scan

Scanned2025-01-06T10:01:12+00:00
URL https://salerno.corriere.it/robots.txt
Domain IPs 199.232.193.50, 199.232.197.50
Response IP 146.75.41.50
Found Yes
Hash 87af5eb13133c322b03f92eb52afbd30a082005a06f4988f2710bb9e6bbf6b13
SimHash f20749d08455

Groups

*

Rule Path
Disallow */localwebapp
Disallow /cronistipercaso/loadArg.do*
Disallow /ricerca/
Disallow /*_print.html$
Disallow /ultima_ora/
Disallow /notizie-ultima-ora/
Disallow /communityLocal/
Disallow /_template/
Disallow /apw.js?
Disallow /*app_v2
Disallow /*app_v1

petalbot

Rule Path
Disallow /

yandex

Rule Path Comment
Disallow / prohibits crawling for the entire site

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.corriere.it/dynamic-sitemap/sitemap-last-100/Salerno.xml

Comments

  • ACAP version=1.0

Warnings

  • 1 invalid line.
  • `acap-crawler` is not a known field.
  • `acap-disallow-crawl` is not a known field.