info.joinjobcorps.com
robots.txt

Robots Exclusion Standard data for info.joinjobcorps.com

Resource Scan

Scan Details

Site Domain info.joinjobcorps.com
Base Domain joinjobcorps.com
Scan Status Ok
Last Scan2024-10-27T19:14:08+00:00
Next Scan 2024-11-26T19:14:08+00:00

Last Scan

Scanned2024-10-27T19:14:08+00:00
URL https://info.joinjobcorps.com/robots.txt
Domain IPs 199.60.103.226, 199.60.103.30, 2606:2c40::c73c:671e, 2606:2c40::c73c:67e2
Response IP 199.60.103.226
Found Yes
Hash 0ef69c5b132cf25c1b6728821562b29534fa76b9accf6783a6ef814e1253b7cf
SimHash 7475c638c5b1

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*