techhumanit.com
robots.txt

Robots Exclusion Standard data for techhumanit.com

Resource Scan

Scan Details

Site Domain techhumanit.com
Base Domain techhumanit.com
Scan Status Ok
Last Scan2025-10-27T23:03:34+00:00
Next Scan 2025-11-26T23:03:34+00:00

Last Scan

Scanned2025-10-27T23:03:34+00:00
URL https://techhumanit.com/robots.txt
Domain IPs 104.18.211.89
Response IP 104.18.211.89
Found Yes
Hash c028f36b62dd61d8f0dfc2bf844f4a83006c0b49348ee295c04e9baa0e15000b
SimHash 6122dea22203

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /search/

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /search/

Other Records

Field Value
crawl-delay 10

mj12bot/v1.4.8

Rule Path
Disallow /

jooblebot/2.0

Rule Path
Disallow /

companybook-crawler

Rule Path
Disallow /

yandexbot/3.0

Rule Path
Disallow /

alphabot/3.2

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /?s=
Disallow /search/

Other Records

Field Value
crawl-delay 10

mj12bot/v1.4.8

Rule Path
Disallow /

jooblebot/2.0

Rule Path
Disallow /

companybook-crawler

Rule Path
Disallow /

yandexbot/3.0

Rule Path
Disallow /

alphabot/3.2

Rule Path
Disallow /

dotbot/1.1

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /