exit.com.ar
robots.txt

Robots Exclusion Standard data for exit.com.ar

Resource Scan

Scan Details

Site Domain exit.com.ar
Base Domain exit.com.ar
Scan Status Ok
Last Scan2024-10-17T06:15:14+00:00
Next Scan 2024-11-16T06:15:14+00:00

Last Scan

Scanned2024-10-17T06:15:14+00:00
URL https://www.exit.com.ar/robots.txt
Domain IPs 13.225.4.41, 13.225.4.50, 13.225.4.56, 13.225.4.9
Response IP 13.225.4.56
Found Yes
Hash cbb3aac019ecf40cf27767fde0656a7ccc4911689d5989c204c1b6636b7ab8fe
SimHash ac905fe4e5f7

Groups

*

Rule Path
Allow /img/*
Disallow /account*
Disallow /login*
Disallow /checkout*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /*?_q=*
Disallow /*map%3D*
Disallow /*map%3Dft
Disallow /*query*
Disallow /*productClusterIds*
Disallow /mobile*
Disallow /*page%3D*
Disallow /?utm_*
Disallow /*map
Disallow /S*
Disallow /s*
Disallow /priceform*

googlebot

Rule Path
Allow /*idsku%3D*
Allow /*skuId%3D*

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-video

Rule Path
Disallow /

googlebot-mobile

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

google storebot

Rule Path
Allow /

google inspectiontool

Rule Path
Allow /

google others

Rule Path
Allow /

google extended

Rule Path
Allow /

bingbot

Rule Path
Allow /

bingpreview

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

yandexbot

Rule Path
Allow /

applebot

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

openai

Rule Path
Allow /

baiduspider

Rule Path
Allow /

sogou

Rule Path
Allow /

yeti/naverbot

Rule Path
Allow /

pinterestbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 0.2

Other Records

Field Value
sitemap https://www.exit.com.ar/sitemap.xml
sitemap https://www.chelsea.com.ar/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.
  • Allow specific robots
  • Limit crawl rate to 5 requests per second