cyde.xyz
robots.txt

Robots Exclusion Standard data for cyde.xyz

Resource Scan

Scan Details

Site Domain cyde.xyz
Base Domain cyde.xyz
Scan Status Ok
Last Scan2025-05-17T08:23:07+00:00
Next Scan 2025-05-24T08:23:07+00:00

Last Scan

Scanned2025-05-17T08:23:07+00:00
URL https://cyde.xyz/robots.txt
Domain IPs 104.21.40.205, 172.67.188.94, 2606:4700:3032::6815:28cd, 2606:4700:3036::ac43:bc5e
Response IP 104.21.40.205
Found Yes
Hash 5823ac3b1b1c5ac3edd6693e1a098df43e06f8d988cdd0f5efedeb1c4ef40f36
SimHash 69f94a310582

Groups

*

Rule Path Comment
Disallow /wp-admin/ For WordPress admin area
Disallow /cgi-bin/ Common server scripts
Disallow /temp/ Temporary files
Disallow /private/ Private content
Disallow /search Search results pages (if applicable)
Disallow /*.pdf$ Block PDF files
Disallow /*.zip$ Block ZIP files

Other Records

Field Value
sitemap https://www.cyde.xyz/sitemap.xml

Comments

  • Allow all search engines
  • Disallow specific folders or files
  • Block indexing of specific file types
  • Sitemap location