global-satinfo.in
robots.txt

Robots Exclusion Standard data for global-satinfo.in

Resource Scan

Scan Details

Site Domain global-satinfo.in
Base Domain global-satinfo.in
Scan Status Ok
Last Scan2025-09-21T19:12:50+00:00
Next Scan 2025-09-28T19:12:50+00:00

Last Scan

Scanned2025-09-21T19:12:50+00:00
URL https://global-satinfo.in/robots.txt
Domain IPs 104.21.46.147, 172.67.140.51, 2606:4700:3032::6815:2e93, 2606:4700:3037::ac43:8c33
Response IP 172.67.140.51
Found Yes
Hash 84a29ba40831c776bf22c18a2a927321431ff813b5ec481d3e0ad3c3dc78882e
SimHash 60001af0213c

Groups

*

Rule Path
Disallow /adm/
Disallow /cache/
Disallow /includes/
Disallow /store/
Disallow /ucp.php
Disallow /mcp.php
Disallow /posting.php
Disallow /report.php
Disallow /cron.php
Disallow /faq.php
Disallow /login
Disallow /logout
Allow /download/file.php*
Allow /memberlist.php?mode=viewprofile*
Allow /assets/
Allow /assets/*.css
Allow /assets/*.js
Allow /images/
Allow /styles/
Allow /styles/*.css
Allow /styles/*.js
Disallow /search.php

Other Records

Field Value
sitemap https://global-satinfo.in/sitemap.xml

Comments

  • Block admin and sensitive areas
  • Allow access to downloadable files
  • Allow profile pages
  • Allow resources needed for proper rendering
  • Block search results to avoid thin/duplicate content
  • Reduce crawl load (optional)
  • Crawl-delay: 5