imtsinstitute.com
robots.txt

Robots Exclusion Standard data for imtsinstitute.com

Resource Scan

Scan Details

Site Domain imtsinstitute.com
Base Domain imtsinstitute.com
Scan Status Ok
Last Scan2026-01-12T09:59:01+00:00
Next Scan 2026-02-11T09:59:01+00:00

Last Scan

Scanned2026-01-12T09:59:01+00:00
URL https://imtsinstitute.com/robots.txt
Domain IPs 216.150.1.1
Response IP 216.150.1.1
Found Yes
Hash a0500b10ada12fbe8adaceed30f4bb63a0c4ede105f18e69f30cf04be8d41d61
SimHash 990d5b102fe8

Groups

*

Rule Path
Allow /
Allow /edu
Allow /courses
Allow /university
Allow /authors
Allow /category
Allow /search
Allow /student-reviews
Disallow /api/
Disallow /admin/
Disallow /dashboard/
Disallow /auth/
Disallow /_next/
Disallow /*.json$
Disallow /*?*utm*
Disallow /*?*ref*
Disallow /*?*fbclid*
Disallow /404
Disallow /500
Allow /robots.txt
Allow /sitemap*.xml
Allow /feed.xml
Allow /manifest.json
Allow /favicon.ico

Other Records

Field Value
sitemap https://imtsinstitute.com/sitemap-index.xml
sitemap https://imtsinstitute.com/sitemap-edu.xml
sitemap https://imtsinstitute.com/sitemap-courses.xml
sitemap https://imtsinstitute.com/student-reviews/sitemap.xml
sitemap https://imtsinstitute.com/feed.xml

Comments

  • IMTS Institute - Robots.txt
  • Optimized for modern Google crawling and daily content indexing
  • Prioritize education content for crawlers
  • Block non-essential areas
  • Allow essential discovery files
  • Modern sitemap structure for daily content discovery
  • Optimized for daily content updates - no crawl delay for faster indexing