deskaeronautico.it
robots.txt

Robots Exclusion Standard data for deskaeronautico.it

Resource Scan

Scan Details

Site Domain deskaeronautico.it
Base Domain deskaeronautico.it
Scan Status Ok
Last Scan2025-04-18T07:25:33+00:00
Next Scan 2025-04-25T07:25:33+00:00

Last Scan

Scanned2025-04-18T07:25:33+00:00
URL https://deskaeronautico.it/robots.txt
Redirect https://www.deskaeronautico.it/robots.txt
Redirect Domain www.deskaeronautico.it
Redirect Base deskaeronautico.it
Domain IPs 46.252.155.42
Redirect IPs 104.21.42.54, 172.67.157.32, 2606:4700:3031::ac43:9d20, 2606:4700:3037::6815:2a36
Response IP 104.21.42.54
Found Yes
Hash d1245abf91d257d97516fc7890765930ae4b995a1f32d11161c1896da8e98971
SimHash 42265ad2a383

Groups

claudebot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

buck

Rule Path
Disallow /

yandex

Rule Path
Disallow /

applebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

*

Rule Path
Disallow /no-index/
Disallow /daos/
Disallow /wiki/
Disallow /staging/
Disallow /demo-pdv/
Disallow /wp-content/
Disallow /wp-admin/
Disallow /sigmet/
Disallow /airmet/
Disallow /sigwx/
Disallow /swll/
Disallow /statistiche/
Disallow /wp-includes/
Disallow /mappa/data/
Disallow *.css
Disallow *.js
Disallow *.geojson

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.deskaeronautico.it/sitemap_index.xml

Comments

  • Simple Robots.txt 0.1

Warnings

  • `noindex` is not a known field.