gpnotebook.com
robots.txt

Robots Exclusion Standard data for gpnotebook.com

Resource Scan

Scan Details

Site Domain gpnotebook.com
Base Domain gpnotebook.com
Scan Status Ok
Last Scan2025-04-13T00:43:33+00:00
Next Scan 2025-05-13T00:43:33+00:00

Last Scan

Scanned2025-04-13T00:43:33+00:00
URL https://gpnotebook.com/robots.txt
Domain IPs 104.26.0.239, 104.26.1.239, 172.67.72.136, 2606:4700:20::681a:1ef, 2606:4700:20::681a:ef, 2606:4700:20::ac43:4888
Response IP 104.26.1.239
Found Yes
Hash 52c43d0580349e913c303752c8251f17eeb4dc2f1d654c6a2f2fe4a805d4e7c3
SimHash 4650cc478615

Groups

*

Rule Path
Allow /

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /jobs/jobsrss
Disallow /jobs/jbequicksignup
Disallow /*sign-in?from=
Disallow /*?nextLocale=
Disallow /*/search?query=
Disallow /*/simpleprocess.cfm?querystring=
Disallow /*?singlesearchstart=
Disallow /*?doublesearchstart=
Disallow /*?triplesearchstart=
Disallow /*refer-id?__hstc=

Other Records

Field Value
sitemap https://gpnotebook.com/en-AU/sitemap-index.xml
sitemap https://gpnotebook.com/en-IE/sitemap-index.xml
sitemap https://gpnotebook.com/en-GB/sitemap-index.xml
sitemap https://gpnotebook.com/fr/sitemap-index.xml
sitemap https://gpnotebook.com/de/sitemap-index.xml
sitemap https://gpnotebook.com/es/sitemap-index.xml
sitemap https://gpnotebook.com/sitemap-index.xml

Comments

  • *
  • *
  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.