dclouds.in
robots.txt

Robots Exclusion Standard data for dclouds.in

Resource Scan

Scan Details

Site Domain dclouds.in
Base Domain dclouds.in
Scan Status Ok
Last Scan2025-11-03T22:14:12+00:00
Next Scan 2025-11-10T22:14:12+00:00

Last Scan

Scanned2025-11-03T22:14:12+00:00
URL https://dclouds.in/robots.txt
Domain IPs 103.106.229.82, 149.28.136.245, 15.235.181.227, 45.32.123.201
Response IP 45.32.123.201
Found Yes
Hash 5f63fe7d9911cb4008506c6913cd2c697035180272d429ce5dae9e468b3e2383
SimHash 00040b73eea7

Groups

*

Rule Path
Disallow /admin/
Disallow /private/
Disallow /tmp/
Disallow /cart/
Disallow /checkout/
Disallow /user-profile/
Disallow /config/
Disallow /scripts/
Disallow /backup/
Disallow /logs/
Disallow /howdy/
Disallow /*.json$
Disallow /*.csv$
Disallow /*.zip$
Disallow /*.tar$
Disallow /*.gz$

googlebot

Rule Path
Allow /
Allow /images/
Allow /css/
Allow /js/

Other Records

Field Value
sitemap https://dclouds.in/sitemap_index.xml

Comments

  • Global settings for all bots
  • Disallow access to sensitive directories
  • Prevent access to files that are typically unnecessary for search engines
  • Allow Googlebot to crawl everything important
  • Allow specific assets that enhance page performance and user experience
  • Sitemap to guide search engines to important pages