chiamarsibomber.com
robots.txt

Robots Exclusion Standard data for chiamarsibomber.com

Resource Scan

Scan Details

Site Domain chiamarsibomber.com
Base Domain chiamarsibomber.com
Scan Status Ok
Last Scan2024-11-16T03:35:38+00:00
Next Scan 2024-11-23T03:35:38+00:00

Last Scan

Scanned2024-11-16T03:35:38+00:00
URL https://chiamarsibomber.com/robots.txt
Redirect https://www.chiamarsibomber.com/robots.txt
Redirect Domain www.chiamarsibomber.com
Redirect Base chiamarsibomber.com
Domain IPs 76.76.21.21
Redirect IPs 76.76.21.164, 76.76.21.93
Response IP 76.76.21.98
Found Yes
Hash c3369ca5e0632cdd064c205fddec2deba3cc2f813fcc39cb6900cfde721a00fe
SimHash 60271f5185d7

Groups

*

Rule Path
Disallow /profile/
Allow /css/
Allow /js/

Other Records

Field Value
sitemap https://www.chiamarsibomber.com/sitemap-base.xml
sitemap https://www.chiamarsibomber.com/sitemap-news.xml
sitemap https://www.chiamarsibomber.com/sitemap-footballers.xml
sitemap https://www.chiamarsibomber.com/news/sitemap.xml
sitemap https://www.chiamarsibomber.com/news/redazione/sitemap.xml

Comments

  • This is a sample robots.txt file for Googlebot
  • It provides instructions to Google's web crawler about which parts of the site should be crawled or not
  • Allow Googlebot to crawl all parts of the site
  • Disallow specific folders and pages for Googlebot
  • Allow Googlebot to access CSS and JavaScript files
  • Sitemap location for Googlebot