wokewaves.com
robots.txt

Robots Exclusion Standard data for wokewaves.com

Resource Scan

Scan Details

Site Domain wokewaves.com
Base Domain wokewaves.com
Scan Status Ok
Last Scan2024-11-12T04:59:35+00:00
Next Scan 2024-11-19T04:59:35+00:00

Last Scan

Scanned2024-11-12T04:59:35+00:00
URL https://wokewaves.com/robots.txt
Redirect https://www.wokewaves.com/robots.txt
Redirect Domain www.wokewaves.com
Redirect Base wokewaves.com
Domain IPs 75.2.70.75, 99.83.190.102
Redirect IPs 52.197.0.54, 52.199.221.217, 54.178.223.218
Response IP 52.197.0.54
Found Yes
Hash 4ace4b5d9653476fbd05bfde5dbbd01cde3f68b2db0b63d4d2c90431e39a1059
SimHash 21280332efa8

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /register/
Disallow /private/
Allow /category/
Allow /tag/
Disallow /search
Disallow /*?*
Allow /assets/css/
Allow /assets/js/
Allow /images/
Disallow /config/
Disallow /scripts/
Disallow /backup/

Other Records

Field Value
sitemap https://www.wokewaves.com/sitemap.xml
sitemap https://www.wokewaves.com/sitemap.xml

Comments

  • Block internal admin or backend directories
  • Allow category and tag pages
  • Block internal search results (avoid duplicate content issues)
  • Block query parameters (if applicable)
  • Allow important resources (CSS, JS, images)
  • Block sensitive files (adjust according to your site's structure)
  • Sitemap