cloudsea35.com
robots.txt

Robots Exclusion Standard data for cloudsea35.com

Resource Scan

Scan Details

Site Domain cloudsea35.com
Base Domain cloudsea35.com
Scan Status Ok
Last Scan2026-03-18T17:32:00+00:00
Next Scan 2026-03-25T17:32:00+00:00

Last Scan

Scanned2026-03-18T17:32:00+00:00
URL https://cloudsea35.com/robots.txt
Domain IPs 104.21.96.60, 172.67.173.154, 2606:4700:3036::6815:603c, 2606:4700:3037::ac43:ad9a
Response IP 172.67.173.154
Found Yes
Hash 2a6d45b1a0964da5f5b1c51bee8177a56bac2c207e8247f0f75ff971f373eaea
SimHash 2f14580ae527

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /feed/
Disallow /comments/
Disallow /trackback/
Disallow /tag/
Allow /wp-admin/admin-ajax.php

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://cloudsea35.com/sitemap.xml
sitemap https://cloudsea35.com/news-sitemap.xml
sitemap https://cloudsea35.com/sitemap_index.xml

Comments

  • CloudSea35.com robots.txt file
  • Generated on 2025-09-03
  • This file tells web crawlers which pages they can or cannot crawl.
  • A. GENERAL RULES FOR ALL CRAWLERS
  • Disallow access to common WordPress administrative and system directories.
  • Allow access to the main administrative-ajax.php file for site functionality.
  • B. SPECIFIC RULES FOR GOOGLEBOT-IMAGE
  • Allow all images to be indexed for Google Images.
  • C. SITEMAP LOCATION
  • This is crucial for guiding search engines to all your important content.