cucinare.tv
robots.txt

Robots Exclusion Standard data for cucinare.tv

Resource Scan

Scan Details

Site Domain cucinare.tv
Base Domain cucinare.tv
Scan Status Ok
Last Scan2024-11-09T19:43:04+00:00
Next Scan 2024-11-16T19:43:04+00:00

Last Scan

Scanned2024-11-09T19:43:04+00:00
URL https://cucinare.tv/robots.txt
Redirect https://www.cucinare.tv/robots.txt
Redirect Domain www.cucinare.tv
Redirect Base cucinare.tv
Domain IPs 104.17.115.20, 104.17.116.20
Redirect IPs 104.17.115.20, 104.17.116.20
Response IP 104.17.115.20
Found Yes
Hash f37e5b7e0422c8fcffee0984521c9b12573e027d2c8bbdc4f84fdf03063f04f6
SimHash aa909d19c764

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/*

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://www.cucinare.tv/sitemap_index.xml

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • For syntax checking, see:
  • http://www.frobee.com/robots-txt-check