campa-cola.in
robots.txt

Robots Exclusion Standard data for campa-cola.in

Resource Scan

Scan Details

Site Domain campa-cola.in
Base Domain campa-cola.in
Scan Status Ok
Last Scan2024-06-30T12:23:55+00:00
Next Scan 2024-07-07T12:23:55+00:00

Last Scan

Scanned2024-06-30T12:23:55+00:00
URL https://campa-cola.in/robots.txt
Domain IPs 13.226.2.44, 13.226.2.46, 13.226.2.72, 13.226.2.92, 2600:9000:21f8:1e00:1b:b182:140:93a1, 2600:9000:21f8:6600:1b:b182:140:93a1, 2600:9000:21f8:7e00:1b:b182:140:93a1, 2600:9000:21f8:800:1b:b182:140:93a1, 2600:9000:21f8:9000:1b:b182:140:93a1, 2600:9000:21f8:9c00:1b:b182:140:93a1, 2600:9000:21f8:b400:1b:b182:140:93a1, 2600:9000:21f8:e00:1b:b182:140:93a1
Response IP 18.165.171.92
Found Yes
Hash 9bb8594011ca1223c3b64dba9781c7bd33f1f40d627fec38245899aad1c1e335
SimHash 74245f5167a5

Groups

*

Rule Path
Disallow /bat/
Disallow /sitempa-full.xml
Allow /*.txt$
Allow /*.html$
Allow /controls/
Allow /controls/*.html$
Allow /css/
Allow /fonts/
Allow /images/
Allow /js/
Allow /product/
Allow /product/*.html$

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://campa-cola.in/sitemap.xml

Comments

  • Disallow access to certain directories and files
  • Block bots from crawling certain URLs
  • Allow access to specific files or directories
  • Set the crawl delay for all bots
  • Set the Sitemap directive to specify the location of your sitemap