abouttheartists.com
robots.txt

Robots Exclusion Standard data for abouttheartists.com

Resource Scan

Scan Details

Site Domain abouttheartists.com
Base Domain abouttheartists.com
Scan Status Ok
Last Scan2024-11-16T18:09:00+00:00
Next Scan 2024-11-23T18:09:00+00:00

Last Scan

Scanned2024-11-16T18:09:00+00:00
URL https://abouttheartists.com/robots.txt
Redirect https://www.abouttheartists.com/robots.txt
Redirect Domain www.abouttheartists.com
Redirect Base abouttheartists.com
Domain IPs 23.253.57.176
Redirect IPs 23.253.57.176
Response IP 23.253.57.176
Found Yes
Hash 925ab83c354a5cb6ecfd105a3fd599a87f871343d6a30447d98464d8776dfb0c
SimHash 6095555265c4

Groups

mediapartners-google

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 4

googlebot
googlebot-image
adsbot-google
adsbot-google-mobile-apps
bingbot
duckduckbot
msnbot
twitterbo
yandex
applebot
facebookexternalhit
oai-searchbot

Product Comment
twitterbo t
Rule Path
Disallow /*/edit
Disallow /*/new
Disallow /*/destroy
Disallow /*/update
Disallow /*/sort
Disallow /movie
Disallow /*/set_default_img
Disallow /*/claim
Disallow /session
Disallow /user
Disallow /articles
Disallow /artists/has_worked_with/*
Disallow /location/set_location
Disallow /production_companies/*/pcps
Disallow /production_works?div_id=*
Disallow /production_engagements
Disallow /venue_metros
Disallow /contribution_characters
Disallow /contributions
Disallow /descriptions
Disallow /jobs
Disallow /job_categories
Disallow /awards
Disallow /award_recipients
Disallow /award_categories/sidebar
Disallow /productions?search
Disallow /production_companies?search
Disallow /plays?search
Disallow /books?search
Disallow /venues?search
Disallow /artists?search
Disallow /artists/same_name

Other Records

Field Value
crawl-delay 4

*

Rule Path
Disallow /*/edit
Disallow /*/new
Disallow /*/destroy
Disallow /*/update
Disallow /*/sort
Disallow /movie
Disallow /*/set_default_img
Disallow /*/claim
Disallow /session
Disallow /user
Disallow /articles
Disallow /*/search*
Disallow /artists/has_worked_with/*
Disallow /contributions/this_just_in
Disallow /location/set_location
Disallow /production_companies/*/pcps
Disallow /production_works?div_id=*
Disallow /production_engagements
Disallow /venue_metros
Disallow /contribution_characters
Disallow /contributions
Disallow /descriptions
Disallow /jobs
Disallow /job_categories
Disallow /awards
Disallow /award_recipients
Disallow /award_categories/sidebar
Disallow /productions?search
Disallow /production_companies?search
Disallow /plays?search
Disallow /books?search
Disallow /venues?search
Disallow /artists?search
Disallow /artists/same_name

Other Records

Field Value
crawl-delay 30

openindexspider
ip-web-crawler.com
ronzoobot
turnitinbot
ahrefsbot
mj12bot
baiduspider
discobot
discoverybot
camontspider
sitebot
findshare bot
sheenbot
ezooms
sosospider
sosospider+
dotbot
panscient.com
sistrix
psbot
wbsearchbot
careerbot
jikespider
seokicks-robot
seokicks
exabot
blexbot
zumbot
yyspider
unisterbot
ccbot
riddler
spbot
xovibot
crazywebcrawler-spider
semrushbot
semrushbot-sa
tineye-bot
tineye-bot-live
memorybot
wotbox
wikido
yeti
maxpointcrawler
addthis.com robot tech.support@clearspring.com
addthis.com
smtbot
sogou web spider
sogou spider
seznambot
megaindex
ltx71
expo9
nextgensearchbot
experibot_v1
wesee_bot
omnibot
ia_archiver
istellabot
siteexplorer
dataprovider
garlikcrawler
dispatch
bubing
cliqzbot
gigabot
g-i-g-a-b-o-t
piplbot
idbot
deusu
contacts-crawler
mozilla/5.0+(compatible;+piplbot;+http://www.pipl.com/bot/)
alphaseobot
barkrowler
daum
mauibot
zoominfobot
a6-indexer
the knowledge ai
alphaseobot
alphaseobot-sa
semanticscholarbot
earwigbot
mojolicious
yandex
archive.org_bot
special_archiver
pinterestbot
googlebot-image
macocu
gptbot
megaindex.ru
dataforseobot
blexbot
gptbot
anthropic-ai
claude-web
ccbot
facebookbot
google-extended
piplbot
senutobot
chatglm-spider
claudebot
perplexitybot

Rule Path
Disallow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.abouttheartists.com/sitemap_index.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • Please note that any unautorized copying or duplication of material from abouttheartists.com is expressly prohibited
  • Republishing any of our data without express written permission from abouttheartists.com is prohibitted and will be prosecuted
  • Accessing abouttheartist.com from behind a VPN, masking your identity behind IP addresses or User Agents of another service,
  • or in any way attempting to disguise or conceal your identity while accessing our site, is prohibited.
  • If you operate a search engine that will significantly increase our traffic, please contact us for permission to crawl the site.
  • Otherwise, automated access to the site is prohibited.

Warnings

  • 1 invalid line.