govdocs.com
robots.txt

Robots Exclusion Standard data for govdocs.com

Resource Scan

Scan Details

Site Domain govdocs.com
Base Domain govdocs.com
Scan Status Ok
Last Scan2025-08-13T19:54:45+00:00
Next Scan 2025-09-12T19:54:45+00:00

Last Scan

Scanned2025-08-13T19:54:45+00:00
URL https://govdocs.com/robots.txt
Redirect https://www.govdocs.com/robots.txt
Redirect Domain www.govdocs.com
Redirect Base govdocs.com
Domain IPs 104.21.47.105, 172.67.146.191, 2606:4700:3032::ac43:92bf, 2606:4700:3036::6815:2f69
Redirect IPs 104.18.37.69, 172.64.150.187, 2606:4700:4400::6812:2545, 2606:4700:4400::ac40:96bb
Response IP 104.18.37.69
Found Yes
Hash c381c060309b82f700c891e94e16162b2e38ef0f023984640d81098e8666bd32
SimHash a0885a52e438

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /wp-admin/
Disallow /govdocs-order-form-test-page/
Disallow /govdocs-order-form-old/
Disallow /advance-auto-govdocs-order-form/
Disallow /exxonmobil-automatic-compliance-labor-law-poster-store/
Disallow /ford-motor-company-backup/
Disallow /ford-motor-company/
Disallow /kelly-services/
Disallow /panda-express-order-page/
Disallow /sams-club-walmart-poster-orders/
Disallow /sams-club-walmart-poster-orders-old/
Disallow /trueblue-staffing-poster-order-form/
Disallow /compliance-faq/
Disallow /dashboard-faq/
Disallow /minimum-wage-FAQ/
Disallow /poster-check-faq/
Disallow /posterchecksandbox/
Disallow /postercheck/
Disallow /*/feed/
Disallow /*/?s=*
Allow /feed/atom/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.govdocs.com/sitemap_index.xml

Comments

  • Prevent Crawling Unnecessary Endpoints - Dynamically added by BigScoots