centroaperture.it
robots.txt

Robots Exclusion Standard data for centroaperture.it

Resource Scan

Scan Details

Site Domain centroaperture.it
Base Domain centroaperture.it
Scan Status Ok
Last Scan2024-11-04T10:13:22+00:00
Next Scan 2024-11-11T10:13:22+00:00

Last Scan

Scanned2024-11-04T10:13:22+00:00
URL https://centroaperture.it/robots.txt
Redirect https://www.centroaperture.it/robots.txt
Redirect Domain www.centroaperture.it
Redirect Base centroaperture.it
Domain IPs 136.243.173.164, 2a01:4f8:171:22a3::2
Redirect IPs 136.243.173.164, 2a01:4f8:171:22a3::2
Response IP 136.243.173.164
Found Yes
Hash 2b030e0a2f5f33e51f8189119bd71f61be2fe437713e239350b66548db4e2c12
SimHash ac129d0ac564

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /ricerca
Disallow /ricerca?*
Disallow /ricerca*

Other Records

Field Value
crawl-delay 10

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • For syntax checking, see:
  • http://www.sxw.org.uk/computing/robots/check.html
  • For Adsense