99list.in
robots.txt

Robots Exclusion Standard data for 99list.in

Resource Scan

Scan Details

Site Domain 99list.in
Base Domain 99list.in
Scan Status Ok
Last Scan2025-04-09T08:46:20+00:00
Next Scan 2025-04-16T08:46:20+00:00

Last Scan

Scanned2025-04-09T08:46:20+00:00
URL https://99list.in/robots.txt
Domain IPs 68.66.226.119
Response IP 68.66.226.119
Found Yes
Hash 5f97fd2a207dde0a6864ff0294e341c9bedc6f0d6dca87d4e243f458a805198e
SimHash 0f4ab89605f5

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*body-style.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://99list.in/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Excluded pages:

Warnings

  • `host` is not a known field.