weerg.com
robots.txt

Robots Exclusion Standard data for weerg.com

Resource Scan

Scan Details

Site Domain weerg.com
Base Domain weerg.com
Scan Status Ok
Last Scan2024-09-20T14:40:22+00:00
Next Scan 2024-10-20T14:40:22+00:00

Last Scan

Scanned2024-09-20T14:40:22+00:00
URL https://weerg.com/robots.txt
Redirect https://www.weerg.com/robots.txt
Redirect Domain www.weerg.com
Redirect Base weerg.com
Domain IPs 108.128.135.120
Redirect IPs 216.137.52.15, 216.137.52.39, 216.137.52.62, 216.137.52.77
Response IP 108.156.22.27
Found Yes
Hash 765497972bc5dfd7a972a6e42ef525d4eb9e503c521b2a776bce1a386b564988
SimHash 706cd470de33

Groups

*

Rule Path
Allow /
Disallow /sample-*
Disallow /blog/sample-*
Disallow /hs-search-results/*
Disallow /myarea/*
Disallow /myarea
Disallow /preventivo-istantaneo/payment*
Disallow /kostenloses-sofortangebot/payment*
Disallow /free-instant-quote/payment*
Disallow /presupuesto-instantaneo-gratuito/payment*
Disallow /devis-gratuit-instantane/payment*
Disallow /_hcms/*
Disallow /hs/cta/*
Disallow /ecm*
Disallow /it/mecspe-hp-weerg
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.weerg.com/sitemap.xml