gpblog.nl
robots.txt
Robots Exclusion Standard data for gpblog.nl
Resource Scan
Scan Details
Site Domain | gpblog.nl |
Base Domain | gpblog.nl |
Scan Status | Ok |
Last Scan | 2024-11-13T11:38:37+00:00 |
Next Scan | 2024-11-20T11:38:37+00:00 |
Last Scan
Scanned | 2024-11-13T11:38:37+00:00 |
URL | https://gpblog.nl/robots.txt |
Redirect | https://www.gpblog.com/nl/robots.txt |
Redirect Domain | www.gpblog.com |
Redirect Base | gpblog.com |
Domain IPs | 34.90.55.240 |
Redirect IPs | 34.117.241.175 |
Response IP | 34.117.241.175 |
Found | Yes |
Hash | e2790c7031cd5ee339cb80a5a08efe7223b49aebc4a5cdabd2d64420a95573a0 |
SimHash | 6e412ee4c790 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /requests/ |
Disallow | /nl/zoeken |
Disallow | /en/search |
Disallow | /de/suche |
Disallow | /es/buscar |
Disallow | /fr/recherche |
Disallow | /it/ricerca |
Disallow | /pt-br/pesquisa |
Disallow | */aggregate |
Other Records
Field | Value |
---|---|
sitemap | https://www.gpblog.com/sitemap.xml |
Warnings
- `host` is not a known field.