founditgulf.com
robots.txt
Robots Exclusion Standard data for founditgulf.com
Resource Scan
Scan Details
Site Domain | founditgulf.com |
Base Domain | founditgulf.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-04-27T05:03:15+00:00 |
Next Scan | 2024-07-26T05:03:15+00:00 |
Last Successful Scan
Scanned | 2023-09-24T04:34:48+00:00 |
URL | https://founditgulf.com/robots.txt |
Redirect | https://www.founditgulf.com/robots.txt |
Redirect Domain | www.founditgulf.com |
Redirect Base | founditgulf.com |
Domain IPs | 20.198.89.77 |
Redirect IPs | 96.17.96.11, 96.17.96.25 |
Response IP | 23.193.97.10 |
Found | Yes |
Hash | fd7eacf848eaa7e43d7fecd218ef9a69ed26ff45a83a2fb21d67b0c0f6bd601a |
SimHash | ed6b4d060b95 |
Groups
*
Rule | Path |
---|---|
Disallow | /seeker/dashboard |
Disallow | /seeker/profile |
Disallow | /pwa/ |
Disallow | /trex/*/ |
Disallow | */middleware/publish/events |
Disallow | /mthinking/ |
Disallow | *track_aor.html |
Disallow | /penguin/api/public/events/new/v2/publish |
Disallow | *?* |
Disallow | /middleware/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.founditgulf.com/xmlsitemap/sitemap-index.xml |
sitemap | https://www.founditgulf.com/xmlsitemap/todays-jobs-sitemap.xml |