smartworld.it
robots.txt

Robots Exclusion Standard data for smartworld.it

Resource Scan

Scan Details

Site Domain smartworld.it
Base Domain smartworld.it
Scan Status Ok
Last Scan2024-11-09T13:42:52+00:00
Next Scan 2024-11-16T13:42:52+00:00

Last Scan

Scanned2024-11-09T13:42:52+00:00
URL https://smartworld.it/robots.txt
Redirect https://www.smartworld.it/robots.txt
Redirect Domain www.smartworld.it
Redirect Base smartworld.it
Domain IPs 34.246.157.188, 52.48.75.183
Redirect IPs 23.50.90.17, 2600:1413:b000:780::3198, 2600:1413:b000:79f::3198
Response IP 104.69.46.165
Found Yes
Hash aff8bd48383b646cbd0df66ef3fd9cf874777a054e4553dd2e4e24c57abf6c04
SimHash 7555f355c7a7

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /cgi-bin/
Disallow /trackback/
Disallow */trackback/
Disallow /confronta-schede/
Disallow /2009/
Disallow /2010/
Disallow /2011/
Disallow /2012/

googlebot-image

Rule Path
Allow /*

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.smartworld.it/sitemap_index.xml
sitemap https://www.smartworld.it/news-sitemap.xml
sitemap https://www.smartworld.it/sitemap/infocommerce_smartworld_1.xml
sitemap https://www.smartworld.it/video-sitemap.xml
sitemap https://www.smartworld.it/contributors-sitemap.xml

Comments

  • abilita adsense
  • disallow all files in these directories
  • allow google image bot to search all images