micromunch.com
robots.txt

Robots Exclusion Standard data for micromunch.com

Resource Scan

Scan Details

Site Domain micromunch.com
Base Domain micromunch.com
Scan Status Ok
Last Scan2025-12-12T19:59:35+00:00
Next Scan 2025-12-19T19:59:35+00:00

Last Scan

Scanned2025-12-12T19:59:35+00:00
URL https://micromunch.com/robots.txt
Domain IPs 104.21.24.47, 172.67.216.211, 2606:4700:3037::6815:182f, 2606:4700:3037::ac43:d8d3
Response IP 172.67.216.211
Found Yes
Hash 80b99eaff939d520a2ca8b0fef4e1e15d090e504bf52e7defb49faeca7099edc
SimHash e40a0a70c4d0

Groups

*

Rule Path
Allow /
Allow /article/
Allow /category/
Disallow /includes/
Disallow /components/includes/
Disallow /api/
Disallow /error_log
Disallow /*.log$
Disallow /__MACOSX/
Disallow /test-*.html
Allow /assets/
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://www.micromunch.com/sitemap.xml

Comments

  • robots.txt for micromunch.com
  • Allow Googlebot and other search engines to crawl landing pages
  • Ensure article/category pages are crawlable (ad landing pages)
  • Disallow sensitive/internal areas
  • Allow assets needed for rendering
  • Sitemap location