icamilli.com
robots.txt

Robots Exclusion Standard data for icamilli.com

Resource Scan

Scan Details

Site Domain icamilli.com
Base Domain icamilli.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-20T13:05:30+00:00
Next Scan 2025-10-04T13:05:30+00:00

Last Successful Scan

Scanned2025-09-05T13:01:21+00:00
URL https://icamilli.com/robots.txt
Domain IPs 139.28.17.138
Response IP 139.28.17.138
Found Yes
Hash 46dbf4b891d9c1a225f4bb6bcfcf042c3c927e0de7f8530f582040094209094b
SimHash aa069d4bc574

Groups

*

Rule Path
Disallow
Allow /ads.txt

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://www.icamilli.com/sitemap.xml

Comments

  • robots.txt automaticaly generated by icamilli.com solution
  • https://www.icamilli.com
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/wc/robots.html
  • All Robots
  • Allow Adsense
  • Sitemap