aihp.in
robots.txt

Robots Exclusion Standard data for aihp.in

Resource Scan

Scan Details

Site Domain aihp.in
Base Domain aihp.in
Scan Status Ok
Last Scan2025-12-06T04:24:28+00:00
Next Scan 2026-01-05T04:24:28+00:00

Last Scan

Scanned2025-12-06T04:24:28+00:00
URL https://aihp.in/robots.txt
Domain IPs 172.66.41.26, 172.66.42.230, 2606:4700:3108::ac42:291a, 2606:4700:3108::ac42:2ae6
Response IP 172.66.42.230
Found Yes
Hash e16d86273ceab691ebf527204ddfd3aa1b1dc091439060e7f197ba8d0379e7b9
SimHash 4a04da316f40

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /cgi-bin/
Disallow /cdn-cgi/l/email-protection
Disallow /*?s=
Disallow /*?replytocom=
Disallow /*?share=
Disallow /*?preview=true
Disallow /*?amp=1
Disallow /*?*utm_*
Disallow /*?*gclid=
Disallow /*?*fbclid=

Other Records

Field Value
sitemap https://aihp.in/sitemap_index.xml

Comments

  • =========
  • Global
  • =========
  • Advisory pointer for AI crawlers
  • System / admin
  • Site search & duplicate params
  • Block AMP only if you do NOT use AMP. Remove this line if you use AMP.
  • Thin archives -> Prefer Yoast 'noindex' instead of blocking.
  • If you really want to block them, uncomment below.
  • Disallow: /tag/
  • Disallow: /author/
  • Disallow: /job-location/
  • Disallow: /job-type/
  • Allow assets for rendering (do NOT block CSS/JS/images)
  • Sitemaps (Yoast handles automatically, one index is enough)

Warnings

  • `llm` is not a known field.