gpnotebook.com
robots.txt
Robots Exclusion Standard data for gpnotebook.com
Resource Scan
Scan Details
Site Domain | gpnotebook.com |
Base Domain | gpnotebook.com |
Scan Status | Ok |
Last Scan | 2025-04-13T00:43:33+00:00 |
Next Scan | 2025-05-13T00:43:33+00:00 |
Last Scan
Scanned | 2025-04-13T00:43:33+00:00 |
URL | https://gpnotebook.com/robots.txt |
Domain IPs | 104.26.0.239, 104.26.1.239, 172.67.72.136, 2606:4700:20::681a:1ef, 2606:4700:20::681a:ef, 2606:4700:20::ac43:4888 |
Response IP | 104.26.1.239 |
Found | Yes |
Hash | 52c43d0580349e913c303752c8251f17eeb4dc2f1d654c6a2f2fe4a805d4e7c3 |
SimHash | 4650cc478615 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Disallow | /jobs/jobsrss |
Disallow | /jobs/jbequicksignup |
Disallow | /*sign-in?from= |
Disallow | /*?nextLocale= |
Disallow | /*/search?query= |
Disallow | /*/simpleprocess.cfm?querystring= |
Disallow | /*?singlesearchstart= |
Disallow | /*?doublesearchstart= |
Disallow | /*?triplesearchstart= |
Disallow | /*refer-id?__hstc= |
Other Records
Field | Value |
---|---|
sitemap | https://gpnotebook.com/en-AU/sitemap-index.xml |
sitemap | https://gpnotebook.com/en-IE/sitemap-index.xml |
sitemap | https://gpnotebook.com/en-GB/sitemap-index.xml |
sitemap | https://gpnotebook.com/fr/sitemap-index.xml |
sitemap | https://gpnotebook.com/de/sitemap-index.xml |
sitemap | https://gpnotebook.com/es/sitemap-index.xml |
sitemap | https://gpnotebook.com/sitemap-index.xml |
Warnings
- `host` is not a known field.
Comments