gw.geneanet.org
robots.txt

Robots Exclusion Standard data for gw.geneanet.org

Resource Scan

Scan Details

Site Domain gw.geneanet.org
Base Domain geneanet.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-03T13:10:11+00:00
Next Scan 2025-11-01T13:10:11+00:00

Last Successful Scan

Scanned2024-12-30T13:08:00+00:00
URL https://gw.geneanet.org/robots.txt
Domain IPs 104.18.35.132, 172.64.152.124, 2606:4700:4400::6812:2384, 2606:4700:4400::ac40:987c
Response IP 104.18.35.132
Found Yes
Hash b712335bfc76dabaac7b1dc0447e2d185bc368634c4b4b4f85c609ae9b8cab09
SimHash 65b1084473be

Groups

*

Rule Path
Disallow /pdf/
Disallow /treeprint/
Disallow /*m%3D
Disallow /*m%3DNG
Allow /*m%3DN
Allow /*m%3DP
Disallow /*color%3D
Disallow /*carto%3D
Disallow /*%26s1%3D
Disallow /*%26s2%3D
Disallow /*%26s3%3D
Disallow /*ei%3D
Disallow /*%26dag%3D
Disallow /*templ%3D
Disallow /*type%3Dgraph
Disallow /*type%3Dlinks
Disallow /*type%3Dcarto
Disallow /*type%3Dstats

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5