atrapalo.com
robots.txt

Robots Exclusion Standard data for atrapalo.com

Resource Scan

Scan Details

Site Domain atrapalo.com
Base Domain atrapalo.com
Scan Status Ok
Last Scan2024-06-01T06:24:24+00:00
Next Scan 2024-06-08T06:24:24+00:00

Last Scan

Scanned2024-06-01T06:24:24+00:00
URL https://atrapalo.com/robots.txt
Redirect https://www.atrapalo.com/robots.txt
Redirect Domain www.atrapalo.com
Redirect Base atrapalo.com
Domain IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Redirect IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Response IP 199.232.46.133
Found Yes
Hash d80518832cf172c9f352342ec1cbdb04ba0d4a30a515186a904f003a9768ddc2
SimHash 0f36e9e1cfe7

Groups

*

Rule Path
Disallow /USUCerrarSesion/
Disallow *?*zanpid=*
Disallow *?*tduid=*
Disallow *?*affId=*
Disallow /forms
Disallow /home
Disallow */_fp
Disallow /tracking/
Disallow /tracker/
Disallow /widget/
Disallow /newsletter/
Disallow /home
Disallow /miatrapalo/
Disallow /hoteles/feed-rss/
Disallow /hoteles/hide/
Disallow /hoteles/profile/lite/*
Disallow /hoteles/profile/reviews/*
Disallow /casasrurales/
Disallow /crucis/
Disallow /opiniones/
Disallow /vuelos/fechas_flexibles
Disallow /dynamicpackaging/
Disallow /dynamic-packaging/
Disallow /common/photo/map/hoteles*
Disallow /common/photo/hotelmap*
Disallow /transport/fare/
Disallow /vuelos/pre_busqueda
Disallow /ms/
Disallow *_search/
Disallow */results/
Disallow */resultados/
Allow /assets/*/results/*css
Allow /js_new/*/results/*js
Disallow /57344399/
Disallow /9201/

megaindex.ru

Rule Path
Disallow /

npbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

grub

Rule Path
Disallow /

larbin

Rule Path
Disallow /

bsshoppingcrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webtrafficexpress

Rule Path
Disallow /

vayalacreep

Rule Path
Disallow /

mercator

Rule Path
Disallow /

httrack

Rule Path
Disallow /

pabloelrobot

Rule Path
Disallow /

wget

Rule Path
Disallow /

teleport

Rule Path
Disallow /

superbot

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

nocilla

Rule Path
Disallow /

henrythemiragorobot

Rule Path
Disallow /

harvest-ng/1.0.2

Rule Path
Disallow /

abachobot

Rule Path
Disallow /

bumblebee@relevare.com

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openfind data gatherer, openbot/3.0+(robot-response@openfind.com.tw;+http://www.openfind.com.tw/robot.html)

Rule Path
Disallow /

slysearch

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

voyager/1.0

Rule Path
Disallow /

arachnophilia

Rule Path
Disallow /

architextspider

Rule Path
Disallow /

aspider/0.09

Rule Path
Disallow /

auresys/1.0

Rule Path
Disallow /

exabot/2.0

Rule Path
Disallow /

lmspider (lmspider@scansoft.com)

Rule Path
Disallow /

appie 1.1 (www.walhello.com)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.atrapalo.com/sitemaps/sitemap.xml.gz

Comments

  • Google Tag Manager
  • Blocked bots
  • User-agent: MJ12bot/v1.0.7 (http://majestic12.co.uk/bot.php?+)
  • Disallow: /

Warnings

  • 2 invalid lines.