mybreedmatch.com
robots.txt

Robots Exclusion Standard data for mybreedmatch.com

Resource Scan

Scan Details

Site Domain mybreedmatch.com
Base Domain mybreedmatch.com
Scan Status Ok
Last Scan2026-02-26T10:23:25+00:00
Next Scan 2026-03-05T10:23:25+00:00

Last Scan

Scanned2026-02-26T10:23:25+00:00
URL https://mybreedmatch.com/robots.txt
Domain IPs 216.198.79.1
Response IP 216.198.79.1
Found Yes
Hash a25e8ba9c3d3fcc58439138410ac27d613b59c7396c0e6387156517abcf0bbb5
SimHash 0d35ce10e857

Groups

*

Rule Path
Allow /
Disallow /_next/
Disallow /api/

mediapartners-google

Rule Path
Allow /

Other Records

Field Value
sitemap https://mybreedmatch.com/sitemap.xml

Comments

  • Allow all crawlers
  • Block Next.js internal files (optional but good practice)
  • Explicitly allow the AdSense bot
  • Sitemap location (Vercel generates this automatically usually, but good to have)