newsport.com.ar
robots.txt

Robots Exclusion Standard data for newsport.com.ar

Resource Scan

Scan Details

Site Domain newsport.com.ar
Base Domain newsport.com.ar
Scan Status Ok
Last Scan2024-09-21T20:36:51+00:00
Next Scan 2024-10-21T20:36:51+00:00

Last Scan

Scanned2024-09-21T20:36:51+00:00
URL https://www.newsport.com.ar/robots.txt
Domain IPs 18.155.68.112, 18.155.68.37, 18.155.68.51, 18.155.68.55, 2600:9000:23d2:1400:3:b9e3:a3c0:93a1, 2600:9000:23d2:2000:3:b9e3:a3c0:93a1, 2600:9000:23d2:5600:3:b9e3:a3c0:93a1, 2600:9000:23d2:5800:3:b9e3:a3c0:93a1, 2600:9000:23d2:7600:3:b9e3:a3c0:93a1, 2600:9000:23d2:bc00:3:b9e3:a3c0:93a1, 2600:9000:23d2:e800:3:b9e3:a3c0:93a1, 2600:9000:23d2:f200:3:b9e3:a3c0:93a1
Response IP 18.155.68.51
Found Yes
Hash 261091c2cf7d94096e3040e01e1936f19af24a86224adbb671f150b674a3ad44
SimHash e410cd474dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*

Other Records

Field Value
sitemap https://www.newsport.com.ar/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.

Warnings

  • `noindex` is not a known field.