realdemadrid.com
robots.txt

Robots Exclusion Standard data for realdemadrid.com

Resource Scan

Scan Details

Site Domain realdemadrid.com
Base Domain realdemadrid.com
Scan Status Ok
Last Scan2025-05-13T06:13:34+00:00
Next Scan 2025-05-20T06:13:34+00:00

Last Scan

Scanned2025-05-13T06:13:34+00:00
URL https://www.realdemadrid.com/robots.txt
Domain IPs 104.26.4.251, 104.26.5.251, 172.67.72.38, 2606:4700:20::681a:4fb, 2606:4700:20::681a:5fb, 2606:4700:20::ac43:4826
Response IP 104.26.5.251
Found Yes
Hash 26829f9cc41f7f12bf5754b46367d952e2f31806803e3542b4446ffa35b2830c
SimHash a324d855242b

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Allow /wp-includes/*.js
Allow /wp-includes/*.css
Allow /wp-includes/blocks/navigation/view.min.js
Allow /wp-admin/admin-ajax.php
Disallow /tag/
Disallow /author/
Disallow /?s=
Disallow /?attachment_id=
Disallow /*/feed/
Disallow /*/trackback/
Disallow /feed/
Disallow /comments/feed/
Disallow /page/
Disallow /?replytocom=

Other Records

Field Value
sitemap https://www.realdemadrid.com/sitemap_index.xml
sitemap https://www.realdemadrid.com/news-sitemap.xml

Comments

  • Allow Googlebot to crawl everything important, block unnecessary pages
  • Block WordPress core directories except specific files needed for rendering
  • Allow specific access to critical JavaScript and CSS in /wp-includes
  • Allow access to admin-ajax.php for functionality
  • Optional: Uncomment if you want to allow access to JSON API
  • Allow: /wp-json/
  • Block pages that don’t need indexing
  • Block common WordPress pages
  • Sitemaps