hawa.jobs
robots.txt

Robots Exclusion Standard data for hawa.jobs

Resource Scan

Scan Details

Site Domain hawa.jobs
Base Domain hawa.jobs
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-17T03:49:59+00:00
Next Scan 2024-12-16T03:49:59+00:00

Last Successful Scan

Scanned2024-05-20T19:39:39+00:00
URL https://www.hawa.jobs/robots.txt
Domain IPs 2600:9000:2003:1000:1c:a282:9cc0:93a1, 2600:9000:2003:1800:1c:a282:9cc0:93a1, 2600:9000:2003:5400:1c:a282:9cc0:93a1, 2600:9000:2003:9800:1c:a282:9cc0:93a1, 2600:9000:2003:ba00:1c:a282:9cc0:93a1, 2600:9000:2003:c200:1c:a282:9cc0:93a1, 2600:9000:2003:d400:1c:a282:9cc0:93a1, 2600:9000:2003:dc00:1c:a282:9cc0:93a1, 52.84.229.119, 52.84.229.44, 52.84.229.50, 52.84.229.95
Response IP 52.84.229.95
Found Yes
Hash 2b22c869c0da1e094f3a8b69aa2aa78305e333f1282b70310c1cd178ea1c4821
SimHash 630341484f94

Groups

*

Rule Path
Disallow /admin$
Disallow /admin/*
Disallow /sa$
Disallow /sa/*
Disallow /api/*
Disallow /users/auth/*
Disallow /sso/*
Disallow /*?*
Disallow /templates/*
Allow /db_assets/production*?t=*
Disallow /job/*/apply
Disallow /job/*/save_job
Disallow /job/*/unsave_job
Disallow /jobs/*/*/*

Other Records

Field Value
sitemap https://www.hawa.jobs/sitemap.xml