theingroupcareers.com
robots.txt
Robots Exclusion Standard data for theingroupcareers.com
Resource Scan
Scan Details
Site Domain | theingroupcareers.com |
Base Domain | theingroupcareers.com |
Scan Status | Ok |
Last Scan | 2024-09-08T02:01:02+00:00 |
Next Scan | 2024-10-08T02:01:02+00:00 |
Last Scan
Scanned | 2024-09-08T02:01:02+00:00 |
URL | https://www.theingroupcareers.com/robots.txt |
Domain IPs | 2600:9000:2003:1c00:1b:4c99:b180:93a1, 2600:9000:2003:200:1b:4c99:b180:93a1, 2600:9000:2003:2400:1b:4c99:b180:93a1, 2600:9000:2003:3200:1b:4c99:b180:93a1, 2600:9000:2003:8000:1b:4c99:b180:93a1, 2600:9000:2003:8800:1b:4c99:b180:93a1, 2600:9000:2003:a000:1b:4c99:b180:93a1, 2600:9000:2003:d200:1b:4c99:b180:93a1, 52.84.229.11, 52.84.229.27, 52.84.229.29, 52.84.229.92 |
Response IP | 52.84.229.27 |
Found | Yes |
Hash | c58006947beeaaeb831381fa114b481cdbcb85691e0986f22b66c5de6fbd3321 |
SimHash | 631141484f94 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin$ |
Disallow | /admin/* |
Disallow | /sa$ |
Disallow | /sa/* |
Disallow | /api/* |
Disallow | /users/auth/* |
Disallow | /sso/* |
Disallow | /*?* |
Disallow | /templates/* |
Allow | /db_assets/production*?t=* |
Disallow | /job/*/apply |
Disallow | /job/*/save_job |
Disallow | /job/*/unsave_job |
Disallow | /jobs/*/*/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.theingroupcareers.com/sitemap.xml |