smartworld.it
robots.txt

Robots Exclusion Standard data for smartworld.it

Resource Scan

Scan Details

Site Domain smartworld.it
Base Domain smartworld.it
Scan Status Ok
Last Scan2024-06-15T06:36:55+00:00
Next Scan 2024-06-22T06:36:55+00:00

Last Scan

Scanned2024-06-15T06:36:55+00:00
URL https://smartworld.it/robots.txt
Redirect https://www.smartworld.it/robots.txt
Redirect Domain www.smartworld.it
Redirect Base smartworld.it
Domain IPs 54.72.109.183, 99.80.61.181
Redirect IPs 104.69.46.165, 2600:1417:3f:ba2::3198, 2600:1417:3f:ba6::3198
Response IP 104.69.46.165
Found Yes
Hash 9b10798e073651a356bca9f79bfb7829f661cda6e03b601ccfd09bc6177cf4a7
SimHash 755cf755c6a7

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Disallow /cgi-bin/
Disallow /trackback/
Disallow */trackback/
Disallow /2009/
Disallow /2010/
Disallow /2011/
Disallow /2012/

googlebot-image

Rule Path
Allow /*

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.smartworld.it/sitemap_index.xml
sitemap https://www.smartworld.it/news-sitemap.xml
sitemap https://www.smartworld.it/sitemap/infocommerce_smartworld_1.xml
sitemap https://www.smartworld.it/video-sitemap.xml

Comments

  • abilita adsense
  • disallow all files in these directories
  • allow google image bot to search all images