theingroupcareers.com
robots.txt

Robots Exclusion Standard data for theingroupcareers.com

Resource Scan

Scan Details

Site Domain theingroupcareers.com
Base Domain theingroupcareers.com
Scan Status Ok
Last Scan2024-09-08T02:01:02+00:00
Next Scan 2024-10-08T02:01:02+00:00

Last Scan

Scanned2024-09-08T02:01:02+00:00
URL https://www.theingroupcareers.com/robots.txt
Domain IPs 2600:9000:2003:1c00:1b:4c99:b180:93a1, 2600:9000:2003:200:1b:4c99:b180:93a1, 2600:9000:2003:2400:1b:4c99:b180:93a1, 2600:9000:2003:3200:1b:4c99:b180:93a1, 2600:9000:2003:8000:1b:4c99:b180:93a1, 2600:9000:2003:8800:1b:4c99:b180:93a1, 2600:9000:2003:a000:1b:4c99:b180:93a1, 2600:9000:2003:d200:1b:4c99:b180:93a1, 52.84.229.11, 52.84.229.27, 52.84.229.29, 52.84.229.92
Response IP 52.84.229.27
Found Yes
Hash c58006947beeaaeb831381fa114b481cdbcb85691e0986f22b66c5de6fbd3321
SimHash 631141484f94

Groups

*

Rule Path
Disallow /admin$
Disallow /admin/*
Disallow /sa$
Disallow /sa/*
Disallow /api/*
Disallow /users/auth/*
Disallow /sso/*
Disallow /*?*
Disallow /templates/*
Allow /db_assets/production*?t=*
Disallow /job/*/apply
Disallow /job/*/save_job
Disallow /job/*/unsave_job
Disallow /jobs/*/*/*

Other Records

Field Value
sitemap https://www.theingroupcareers.com/sitemap.xml