globalfamily.in
robots.txt
Robots Exclusion Standard data for globalfamily.in
Resource Scan
Scan Details
Site Domain | globalfamily.in |
Base Domain | globalfamily.in |
Scan Status | Ok |
Last Scan | 2024-09-25T05:25:12+00:00 |
Next Scan | 2024-10-02T05:25:12+00:00 |
Last Scan
Scanned | 2024-09-25T05:25:12+00:00 |
URL | https://globalfamily.in/robots.txt |
Domain IPs | 162.241.174.180 |
Response IP | 162.241.174.180 |
Found | Yes |
Hash | b59e3e0d3808e1f2fa5477bf3bcf2521a52cbe1524c014ec00034331be6581f4 |
SimHash | 24443ec0d593 |
Groups
*
Rule | Path |
---|---|
Allow | /ads/preferences/ |
Allow | /gpt/ |
Disallow | / |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /cgi-bin/ |
Disallow | /assets/ |
Disallow | /beta_thora_bachke_rahna/ |
Disallow | /pages_emp/ |
Disallow | /pages_error/ |
Disallow | /pages_funder/ |
Disallow | /pages_guest/ |
Disallow | /pages_helpless/ |
Disallow | /pages_layout/ |
Disallow | /pages_ngo/ |
Disallow | /pages_product/ |
Disallow | /pages_service/ |
Disallow | /pages_student/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.globalfamily.in/sitemap.xml |
Warnings
- `noindex` is not a known field.
Comments