trvlland.com
robots.txt

Robots Exclusion Standard data for trvlland.com

Resource Scan

Scan Details

Site Domain trvlland.com
Base Domain trvlland.com
Scan Status Ok
Last Scan2025-09-27T15:17:10+00:00
Next Scan 2025-10-27T15:17:10+00:00

Last Scan

Scanned2025-09-27T15:17:10+00:00
URL https://trvlland.com/robots.txt
Domain IPs 104.21.43.131, 172.67.179.139, 2606:4700:3030::ac43:b38b, 2606:4700:3033::6815:2b83
Response IP 172.67.179.139
Found Yes
Hash 44f81ce9a34a805cc3e174efe0d288296b6ec5f8a3c798b8e3277b4ef9f9e74c
SimHash ed129990db31

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /wp-admin/
Disallow /?
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow */embed
Disallow */page/
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow /wp-content/uploads/*.pdf
Disallow /wp-content/uploads/*.doc
Disallow /wp-content/uploads/*.docx
Disallow /find-adventure/*
Disallow /fr/trouver-l-aventure/*
Disallow /it/trova-l-avventura/*
Disallow /de/finde-abenteuer/*
Allow /wp-content/admin-ajax.php
Allow *min?*
Allow *.js*
Allow *.css*

ahrefssiteaudit

Rule Path
Allow /

ahrefsbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://trvlland.com/sitemap.xml