darwinbox.in
robots.txt

Robots Exclusion Standard data for darwinbox.in

Resource Scan

Scan Details

Site Domain darwinbox.in
Base Domain darwinbox.in
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-15T17:52:27+00:00
Next Scan 2024-06-29T17:52:27+00:00

Last Successful Scan

Scanned2022-10-30T17:06:33+00:00
URL http://darwinbox.in/robots.txt
Redirect https://darwinbox.com/robots.txt
Redirect Domain darwinbox.com
Redirect Base darwinbox.com
Response IP 3.109.39.237
Found Yes
Hash ee90d4912cbaa4b32201a7e8fa7901f6bcafc38f79521b9396347cb8139fdef4
SimHash 7945ddaaa1b3

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /blog*
Disallow /readme.html
Disallow /*?utm
Disallow /*.pdf$
Disallow /?s=
Disallow /search/
Disallow /sub-processors.pdf
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://darwinbox.com/sitemap_index.xml
sitemap https://darwinbox.com/page-sitemap.xml