menuism.com
robots.txt

Robots Exclusion Standard data for menuism.com

Resource Scan

Scan Details

Site Domain menuism.com
Base Domain menuism.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-09-05T13:22:14+00:00
Next Scan 2024-12-04T13:22:14+00:00

Last Successful Scan

Scanned2024-05-09T13:15:41+00:00
URL https://www.menuism.com/robots.txt
Domain IPs 104.21.59.103, 172.67.223.9
Response IP 172.67.223.9
Found Yes
Hash b2c33ee2fd62519cd9546b51034108f4dc67e69cb689872661b7be441b4432de
SimHash 8329a39d8dab

Groups

ia_archiver
slysearch
turnitinbot
blackwidow
chinaclaw
custo
disco
download demon
ecatch
eirgrabber
emailsiphon
emailwolf
express webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
ichiro
image stripper
image sucker
indy library
interget
internet ninja
jetcar
joc web spider
larbin
leechftp
scoutjet
mass downloader
midown tool
mister pix
navroad
nearsite
netants
netspider
net vampire
netzip
octopus
pagegrabber
papa foto
pavuk
pcbrowser
realdownload
reget
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport pro
voideye
web image collector
web sucker
webauto
webcopier
webfetch
webgo is
webleacher
webreaper
websauger
website extractor
website quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon webspider
zeus
robot
spider
crawl

Rule Path
Disallow /

*

Rule Path
Disallow /mobile/
Disallow /rating/
Disallow /search
Disallow /restaurants/add_review
Disallow /restaurant/edit/
Disallow /links/partner
Disallow /links/goto
Disallow /restaurant/edit_location
Disallow /surveys/
Disallow /owner/index/
Disallow /tries/add/
Disallow /favorite/add/
Disallow /images/edit_image/
Disallow /post/tip/
Disallow /more_contact_info
Disallow /maps/
Disallow /upload_images/
Disallow /ratewhatyouate
Disallow /sms/
Disallow /wdgtfrm

slurp

Rule Path
Disallow /*mp%3D
Disallow /*/by-order_
Disallow /*/by-sort_
Disallow /*/by-filter_
Disallow /mobile/
Disallow /rating/
Disallow /search
Disallow /restaurants/add_review
Disallow /restaurant/edit/
Disallow /links/partner
Disallow /links/goto
Disallow /restaurant/edit_location
Disallow /surveys/
Disallow /owner/index/
Disallow /tries/add/
Disallow /favorite/add/
Disallow /images/edit_image/
Disallow /post/tip/
Disallow /more_contact_info
Disallow /maps/
Disallow /upload_images/
Disallow /ratewhatyouate
Disallow /sms/

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path Comment
Disallow /*mp%3D -
Disallow /*/by-order_ -
Disallow /*/by-sort_ -
Disallow /*/by-filter_ -
Disallow /mobile/ -
Disallow /rating/ -
Disallow /search -
Disallow /restaurants/add_review -
Disallow /restaurant/edit/ -
Disallow /links/partner -
Disallow /links/goto -
Disallow /restaurant/edit_location -
Disallow /surveys/ -
Disallow /owner/index/ -
Disallow /tries/add/ -
Disallow /favorite/add/ -
Disallow /images/edit_image/ -
Disallow /post/tip/ -
Disallow /more_contact_info -
Disallow /maps/ -
Disallow /upload_images/ -
Disallow /ratewhatyouate -
Disallow /sms/ -
Disallow /1009297/ GPT

msnbot

Rule Path
Disallow /*mp%3D
Disallow /*/by-order_
Disallow /*/by-sort_
Disallow /*/by-filter_
Disallow /mobile/
Disallow /rating/
Disallow /search
Disallow /restaurants/add_review
Disallow /restaurant/edit/
Disallow /links/partner
Disallow /links/goto
Disallow /restaurant/edit_location
Disallow /surveys/
Disallow /owner/index/
Disallow /tries/add/
Disallow /favorite/add/
Disallow /images/edit_image/
Disallow /post/tip/
Disallow /more_contact_info
Disallow /maps/
Disallow /upload_images/
Disallow /ratewhatyouate
Disallow /sms/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • robots we really don't want
  • Disallow: /cities/home
  • some noindex in head - like for pagination and sort order
  • Disallow: /cities/home
  • Disallow: /cities/home
  • Disallow: /cities/home

Warnings

  • 1 invalid line.