spreadsheet123.com
robots.txt

Robots Exclusion Standard data for spreadsheet123.com

Resource Scan

Scan Details

Site Domain spreadsheet123.com
Base Domain spreadsheet123.com
Scan Status Ok
Last Scan2024-09-29T19:29:14+00:00
Next Scan 2024-10-06T19:29:14+00:00

Last Scan

Scanned2024-09-29T19:29:14+00:00
URL https://spreadsheet123.com/robots.txt
Redirect https://www.spreadsheet123.com/robots.txt
Redirect Domain www.spreadsheet123.com
Redirect Base spreadsheet123.com
Domain IPs 45.76.15.252
Redirect IPs 45.76.15.252
Response IP 45.76.15.252
Found Yes
Hash 45e567f8908b3a6377bcae4d1ee6aee76733aec6179ee12a25bafcc0296c315a
SimHash 9314dc164d12

Groups

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /blog/wp-login.php
Disallow /blog/*wp-login.php*
Disallow /blog/trackback
Disallow /blog/*/trackback/$
Disallow /blog/cgi-bin
Disallow /blog/search
Disallow /blog/rss
Disallow /blog/tag/*
Disallow /blog/tag
Disallow /blog/comments/feed
Disallow /blog/comments
Disallow /blog/login/
Disallow /blog/feed
Disallow /blog/feed/$
Disallow /blog/*/feed/$
Disallow /blog/*/feed/rss/$

atspider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

dsurf

Rule Path
Disallow /

elitesys entry

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

mail sweeper

Rule Path
Disallow /

munky

Rule Path
Disallow /

roverbot

Rule Path
Disallow /

webemailextrac

Rule Path
Disallow /

xget

Rule Path
Disallow /

wget

Rule Path
Disallow /

webwalk

Rule Path
Disallow /

webvac

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webmirror

Rule Path
Disallow /

webfetcher

Rule Path
Disallow /

webcopy

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcatcher

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

w3mir

Rule Path
Disallow /

vobsub

Rule Path
Disallow /

templeton

Rule Path
Disallow /

ssearcher100

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

shai'hulud

Rule Path
Disallow /

pbwf

Rule Path
Disallow /

lightningdownload

Rule Path
Disallow /

kdd exploror

Rule Path
Disallow /

jeeves

Rule Path
Disallow /

internet explore

Rule Path
Disallow /

infospiders

Rule Path
Disallow /

httrack

Rule Path
Disallow /

havindex

Rule Path
Disallow /

geturl

Rule Path
Disallow /

getbot

Rule Path
Disallow /

esirover

Rule Path
Disallow /

download wonder

Rule Path
Disallow /

collage

Rule Path
Disallow /

mozilla/2.0 (compatible; ms frontpage 4.0)

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.spreadsheet123.com/sitemap.xml
sitemap https://www.spreadsheet123.com/blog/sitemap.xml

Comments

  • Disallow Collectors and Spam
  • Disallow Offline Browsers