st1.trov.it
robots.txt

Robots Exclusion Standard data for st1.trov.it

Resource Scan

Scan Details

Site Domain st1.trov.it
Base Domain trov.it
Scan Status Ok
Last Scan2024-10-26T14:56:18+00:00
Next Scan 2024-11-02T14:56:18+00:00

Last Scan

Scanned2024-10-26T14:56:18+00:00
URL https://st1.trov.it/robots.txt
Domain IPs 18.155.68.117, 18.155.68.15, 18.155.68.25, 18.155.68.7
Response IP 18.155.68.117
Found Yes
Hash 7408bc38a36e8c91aa8e6e4c05835956b04233ac8a6ad3e31198f5aceeba033f
SimHash e21f71808770

Groups

*

Rule Path
Disallow /redirect/
Disallow /scripts/redirect.php/
Disallow /index.php/
Allow /index.php/cod.get_premium/
Allow /index.php/cod.get_similars_popup/
Allow /index.php/cod.get_main_email_domains/
Disallow /index.php/cod.AdsenseClickTracking/
Disallow /rd/
Disallow /rss/
Disallow /listing/
Disallow /details/
Disallow /project/
Disallow /publisher/
Disallow /afc/
Disallow /notifications
Disallow /index.php/cod.mail_preferences/

timpibot

Rule Path
Disallow /

wijubot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

bloglines/3.1

Rule Path
Disallow /

jyxobot/1

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

sirdatabot

Rule Path
Disallow /

integralads

Rule Path
Disallow /

ttd-content

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

gumgum

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

adstxtcrawlertp

Rule Path
Disallow /

bidswitchbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

Comments

  • Classic robots.txt file
  • Common bots to block