intu.io
robots.txt

Robots Exclusion Standard data for intu.io

Resource Scan

Scan Details

Site Domain intu.io
Base Domain intu.io
Scan Status Ok
Last Scan2025-12-12T12:30:17+00:00
Next Scan 2026-01-11T12:30:17+00:00

Last Scan

Scanned2025-12-12T12:30:17+00:00
URL https://intu.io/robots.txt
Domain IPs 185.30.32.179
Response IP 185.30.32.179
Found Yes
Hash b79d03c4199a689ecd2132d39a7a0c529ed22c4d4f3049d66ea8a01d27081f7b
SimHash 501d8e55d3f5

Groups

*

Rule Path
Disallow /build.txt
Disallow /blog/cgi-bin
Disallow /blog/wp-admin
Disallow /blog/wp-includes
Disallow /blog/wp-content/plugins
Disallow /blog/wp-content/cache
Disallow /blog/wp-content/themes
Disallow /blog/trackback
Disallow /blog/comments
Disallow /blog/category/*/*
Disallow /blog/*/trackback
Disallow /blog/*/comments
Disallow /blog/*?*
Disallow /blog/*?
Allow /blog/wp-content/uploads

googlebot-image

Rule Path
Disallow
Allow /*

mediapartners-google*

Rule Path
Disallow
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Other Records

Field Value
sitemap https://intu.io/sitemap.xml
sitemap https://intu.io/blog/sitemap_index.xml

Comments

  • www.robotstxt.org
  • Google Image
  • Google AdSense
  • Internet Archiver Wayback Machine
  • digg mirror