latestata.it
robots.txt

Robots Exclusion Standard data for latestata.it

Resource Scan

Scan Details

Site Domain latestata.it
Base Domain latestata.it
Scan Status Ok
Last Scan2025-11-14T21:59:23+00:00
Next Scan 2025-11-21T21:59:23+00:00

Last Scan

Scanned2025-11-14T21:59:23+00:00
URL https://latestata.it/robots.txt
Redirect https://www.latestata.it/robots.txt
Redirect Domain www.latestata.it
Redirect Base latestata.it
Domain IPs 104.26.0.171, 104.26.1.171, 172.67.70.55, 2606:4700:20::681a:1ab, 2606:4700:20::681a:ab, 2606:4700:20::ac43:4637
Redirect IPs 104.26.0.171, 104.26.1.171, 172.67.70.55, 2606:4700:20::681a:1ab, 2606:4700:20::681a:ab, 2606:4700:20::ac43:4637
Response IP 104.26.1.171
Found Yes
Hash d08d1cc41410d634738fb03b2608703c6305ea657d87bab7cd4404808324e67b
SimHash 4a305b53e2db

Groups

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

yandex

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

whatsapp

Rule Path
Allow /

telegrambot

Rule Path
Allow /

ccbot

Rule Path
Allow /

pinterest

Rule Path
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

brightbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

applebot

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

scoop.it

Rule Path
Disallow /

seekr

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /xmlrpc.php
Disallow /cgi-bin/
Disallow /trackback/
Disallow /comments/
Disallow /*?s=
Allow /feed/
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /wp-content/plugins/

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.latestata.it/sitemap_index.xml

Comments

  • ✅ MOTORI DI RICERCA (accesso completo)
  • ✅ BOT DEI SOCIAL (anteprime e condivisioni)
  • ✅ BOT DI MONITORAGGIO E SEO POSITIVI
  • ❌ BLOCCO BOT AI E COMMERCIALI INVASIVI
  • ⚙️ IMPOSTAZIONI STANDARD WORDPRESS
  • 🗺️ SITEMAP
  • 🕒 OPZIONALE

Warnings

  • `host` is not a known field.