gccvacancies.com
robots.txt

Robots Exclusion Standard data for gccvacancies.com

Resource Scan

Scan Details

Site Domain gccvacancies.com
Base Domain gccvacancies.com
Scan Status Ok
Last Scan2026-03-05T02:10:42+00:00
Next Scan 2026-03-12T02:10:42+00:00

Last Scan

Scanned2026-03-05T02:10:42+00:00
URL https://gccvacancies.com/robots.txt
Domain IPs 86.38.243.102
Response IP 86.38.243.102
Found Yes
Hash aedc831476e8658db8c43fa7b604c80dc728f62bc8417b834f8922b14aa539c9
SimHash e98088226fb3

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /*?add-to-cart=
Disallow /*?*add-to-cart=
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://gccvacancies.com/wp-sitemap.xml