lasvegasmagazine.com
robots.txt

Robots Exclusion Standard data for lasvegasmagazine.com

Resource Scan

Scan Details

Site Domain lasvegasmagazine.com
Base Domain lasvegasmagazine.com
Scan Status Ok
Last Scan2024-06-08T02:30:23+00:00
Next Scan 2024-06-15T02:30:23+00:00

Last Scan

Scanned2024-06-08T02:30:23+00:00
URL https://lasvegasmagazine.com/robots.txt
Domain IPs 104.18.10.229, 104.18.11.229, 2606:4700::6812:ae5, 2606:4700::6812:be5
Response IP 104.18.11.229
Found Yes
Hash 00fa5eccfffbf6dbe218b70ffdf1d8cdfd6cc0dff2e8fc93682a745775a60242
SimHash c900d818d7b2

Groups

*

Rule Path
Disallow /?*
Disallow /%3A
Disallow /%3A/
Disallow /*rawhtml*
Disallow /r/
Disallow /slideshow_xml/*
Disallow */cdn-cgi/l/email-protection*
Disallow */slideshow_xml/*
Disallow */xml/*
Disallow *inlines/*
Disallow *cdn-cgi/*
Disallow */drudged/*

directcrawler

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

java/1.5.0_11

Rule Path
Disallow /

java/1.4.1_04

Rule Path
Disallow /

gsa-crawler

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

directcrawler

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

java/1.5.0_11

Rule Path
Disallow /

java/1.4.1_04

Rule Path
Disallow /

gsa-crawler

Rule Path
Disallow /

shopwiki
twitterbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://lasvegasmagazine.com/sitemap.xml

Comments

  • other user agents
  • User-agent: ia_archiver-web.archive.org
  • Disallow: /
  • go away
  • User-agent: ia_archiver-web.archive.org
  • Disallow: /
  • Twitter allow