mygwork.com
robots.txt

Robots Exclusion Standard data for mygwork.com

Resource Scan

Scan Details

Site Domain mygwork.com
Base Domain mygwork.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-20T10:04:28+00:00
Next Scan 2026-04-21T10:04:28+00:00

Last Successful Scan

Scanned2025-12-21T06:37:35+00:00
URL https://mygwork.com/robots.txt
Domain IPs 104.20.20.26, 172.66.169.181, 2606:4700:10::6814:141a, 2606:4700:10::ac42:a9b5
Response IP 104.20.20.26
Found Yes
Hash e61c7be2ffa797bcd23a5fdf4d508ee89dd1a7de0b7e798dd14f68b157419654
SimHash 0694ab49a074

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /forget-password
Disallow /impersonate
Disallow /onboarding
Disallow /reset-password
Disallow /signin
Disallow /signup
Disallow /verify-account
Disallow /verify-collect-email
Disallow /verify-email
Disallow /connections
Disallow /profile/settings
Disallow /profile/settings*
Disallow /profile/documents
Disallow /profile/organization-profile
Disallow /organizations/*/create-event
Disallow /organizations/*/create-article
Disallow /organizations/*/edit
Disallow /edit-event/*
Disallow /edit-article/*
Disallow /*/jobs/*/apply
Disallow /members
Disallow /members/*

Other Records

Field Value
sitemap https://www.mygwork.com/sitemap.xml
sitemap https://www.mygwork.com/jobs_sitemap.xml
sitemap https://www.mygwork.com/events_sitemap.xml
sitemap https://www.mygwork.com/news_sitemap.xml

Comments

  • www.robotstxt.org/
  • www.google.com/support/webmasters/bin/answer.py?hl=en&answer=156449
  • Last updated: 2024-11-24
  • CDN and technical paths
  • Auth pages (specific pages without locale)
  • Private pages (specific pages without locale)
  • Profile and settings pages (with wildcards for all variations)
  • Job pages with wildcards (applies to all locales)
  • Member pages with wildcards (applies to all locales)
  • Sitemaps - Submit each separately to Google Search Console