gccertification.com
robots.txt

Robots Exclusion Standard data for gccertification.com

Resource Scan

Scan Details

Site Domain gccertification.com
Base Domain gccertification.com
Scan Status Ok
Last Scan2025-09-27T09:45:09+00:00
Next Scan 2025-10-27T09:45:09+00:00

Last Scan

Scanned2025-09-27T09:45:09+00:00
URL https://gccertification.com/robots.txt
Domain IPs 104.26.14.88, 104.26.15.88, 172.67.71.197, 2606:4700:20::681a:e58, 2606:4700:20::681a:f58, 2606:4700:20::ac43:47c5
Response IP 104.26.14.88
Found Yes
Hash a5fb3e6fe0443ef3fbb66a0e60c16815f7a7f03750eaae59536731bec6bffe0f
SimHash 6900c94009b1

Groups

*

Rule Path
Allow /wp-content/uploads/
Disallow /wp-admin/
Disallow /readme.html
Disallow /refer/
Disallow /feed/
Disallow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://gccertification.com/sitemap.xml