eduyug.com
robots.txt

Robots Exclusion Standard data for eduyug.com

Resource Scan

Scan Details

Site Domain eduyug.com
Base Domain eduyug.com
Scan Status Ok
Last Scan2025-12-21T16:45:45+00:00
Next Scan 2026-01-20T16:45:45+00:00

Last Scan

Scanned2025-12-21T16:45:45+00:00
URL https://eduyug.com/robots.txt
Domain IPs 104.21.54.33, 172.67.223.27, 2606:4700:3031::6815:3621, 2606:4700:3033::ac43:df1b
Response IP 172.67.223.27
Found Yes
Hash 0ff5bc4e0408914261b2081dfdcdb03fd888cc8b7572e267febb09f46681ba1b
SimHash f51e8e5146b4

Groups

*

Rule Path
Allow /
Allow /assets/
Allow /assest/
Disallow /application/
Disallow /system/
Disallow /index.php/
Disallow /admin/
Disallow /private/
Disallow /temp/
Disallow /cache/
Disallow /logs/
Disallow /*?*
Disallow /*%26*
Allow /sitemap.xml
Allow /favicon.ico
Allow /robots.txt

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://eduyug.com/sitemap.xml

Comments

  • Robots.txt for EduYug.com
  • School ERP Software Website
  • Allow all search engines to crawl the site
  • Allow access to main directories
  • Disallow access to sensitive directories
  • Disallow query parameters that don't add value
  • Allow specific important files
  • Block specific bots if needed (uncomment if required)
  • User-agent: AhrefsBot
  • Disallow: /
  • User-agent: MJ12bot
  • Disallow: /
  • Crawl delay for all bots (optional - helps with server load)
  • Sitemap location
  • Additional sitemaps (if you create more specific ones)
  • Sitemap: https://eduyug.com/sitemap-images.xml
  • Sitemap: https://eduyug.com/sitemap-blog.xml