themerge.in
robots.txt
Robots Exclusion Standard data for themerge.in
Resource Scan
Scan Details
Site Domain | themerge.in |
Base Domain | themerge.in |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-09-23T06:39:28+00:00 |
Next Scan | 2025-09-30T06:39:28+00:00 |
Last Successful Scan
Scanned | 2025-09-15T03:52:41+00:00 |
URL | https://themerge.in/robots.txt |
Redirect | https://www.themerge.in/robots.txt |
Redirect Domain | www.themerge.in |
Redirect Base | themerge.in |
Domain IPs | 104.26.12.205, 104.26.13.205, 172.67.74.152, 2606:4700:20::681a:ccd, 2606:4700:20::681a:dcd, 2606:4700:20::ac43:4a98 |
Redirect IPs | 104.26.12.205, 104.26.13.205, 172.67.74.152, 2606:4700:20::681a:ccd, 2606:4700:20::681a:dcd, 2606:4700:20::ac43:4a98 |
Response IP | 104.26.13.205 |
Found | Yes |
Hash | a69e8fed9a4236b31f358a407c367d857e9c9df8e8c5ab739b199f2d6f8192ec |
SimHash | d94dd8c0a01b |
Groups
*
Rule | Path |
---|---|
Disallow | /?s= |
Disallow | /page/*/?s= |
Disallow | /search/ |
Disallow | /wp-json/ |
Disallow | /?rest_route= |
Other Records
Field | Value |
---|---|
sitemap | https://www.themerge.in/sitemap_index.xml |
Comments