gw.geneanet.org
robots.txt
Robots Exclusion Standard data for gw.geneanet.org
Resource Scan
Scan Details
Site Domain | gw.geneanet.org |
Base Domain | geneanet.org |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-08-03T13:10:11+00:00 |
Next Scan | 2025-11-01T13:10:11+00:00 |
Last Successful Scan
Scanned | 2024-12-30T13:08:00+00:00 |
URL | https://gw.geneanet.org/robots.txt |
Domain IPs | 104.18.35.132, 172.64.152.124, 2606:4700:4400::6812:2384, 2606:4700:4400::ac40:987c |
Response IP | 104.18.35.132 |
Found | Yes |
Hash | b712335bfc76dabaac7b1dc0447e2d185bc368634c4b4b4f85c609ae9b8cab09 |
SimHash | 65b1084473be |
Groups
*
Rule | Path |
---|---|
Disallow | /pdf/ |
Disallow | /treeprint/ |
Disallow | /*m%3D |
Disallow | /*m%3DNG |
Allow | /*m%3DN |
Allow | /*m%3DP |
Disallow | /*color%3D |
Disallow | /*carto%3D |
Disallow | /*%26s1%3D |
Disallow | /*%26s2%3D |
Disallow | /*%26s3%3D |
Disallow | /*ei%3D |
Disallow | /*%26dag%3D |
Disallow | /*templ%3D |
Disallow | /*type%3Dgraph |
Disallow | /*type%3Dlinks |
Disallow | /*type%3Dcarto |
Disallow | /*type%3Dstats |