swimbikerun.ph
robots.txt

Robots Exclusion Standard data for swimbikerun.ph

Resource Scan

Scan Details

Site Domain swimbikerun.ph
Base Domain swimbikerun.ph
Scan Status Ok
Last Scan2025-09-28T07:37:37+00:00
Next Scan 2025-10-05T07:37:37+00:00

Last Scan

Scanned2025-09-28T07:37:37+00:00
URL https://swimbikerun.ph/robots.txt
Domain IPs 104.21.35.220, 172.67.180.59, 2606:4700:3031::ac43:b43b, 2606:4700:3036::6815:23dc
Response IP 172.67.180.59
Found Yes
Hash 70193f993bf812a88ac4f29a837cb627b73373e3a8c754d56e10ca4a1f9f8a7d
SimHash 1626114b26ea

Groups

googlebot

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/

bingbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /calendar/
Allow /

facebookexternalhit

Rule Path
Allow /

twitterbot

Rule Path
Allow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /xmlrpc.php
Disallow /trackback/
Disallow /search/
Disallow /feed/
Disallow /login/
Disallow /signup/
Disallow /author/
Disallow /private/
Disallow /tmp/

*

Rule Path
Disallow /*.php$
Disallow /*.cgi$
Disallow /*.txt$
Disallow /*.log$
Disallow /*.zip$
Disallow /*.sql$
Disallow /*.json$
Disallow /*.inc$
Disallow /*.asp$
Disallow /*.htm$
Disallow /*.shtml$
Disallow /*.pl$
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://swimbikerun.ph/sitemap_index.xml
sitemap https://swimbikerun.ph/sitemap_index.xml

Comments

  • SwimBikeRun.ph - Ultimate SEO & Performance Optimized Robots.txt
  • Version: Gold Standard | Last Updated: March 29, 2025
  • ✅ Allow Googlebot to index core site content
  • ❌ Block BingBot completely (Too resource-intensive, not valuable)
  • ❌ Block known aggressive crawlers to preserve bandwidth and server resources
  • 🚫 Control specific bot activity to prevent excessive dynamic page hits
  • 🔗 Allow social media crawlers for proper sharing previews
  • 🔒 Block backend WordPress directories, but allow needed frontend assets
  • 🛡 Block indexing of sensitive or irrelevant file types
  • 🎯 Allow media, uploads, and AJAX functionality
  • 📍 Sitemap for structured crawling