earthist.in
robots.txt
Robots Exclusion Standard data for earthist.in
Resource Scan
Scan Details
Site Domain | earthist.in |
Base Domain | earthist.in |
Scan Status | Ok |
Last Scan | 2024-09-07T00:45:03+00:00 |
Next Scan | 2024-10-07T00:45:03+00:00 |
Last Scan
Scanned | 2024-09-07T00:45:03+00:00 |
URL | https://earthist.in/robots.txt |
Domain IPs | 192.0.78.155, 192.0.78.205 |
Response IP | 192.0.78.155 |
Found | Yes |
Hash | daa67588c59fed6d45aa9edd784fd27e4363c958393195b10b194d2805530cc4 |
SimHash | eb810802ed93 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wc-logs/ |
Disallow | /wp-content/uploads/woocommerce_transient_files/ |
Disallow | /wp-content/uploads/woocommerce_uploads/ |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://earthist.in/sitemap.xml |
sitemap | https://earthist.in/news-sitemap.xml |