tocusinfotech.com
robots.txt

Robots Exclusion Standard data for tocusinfotech.com

Resource Scan

Scan Details

Site Domain tocusinfotech.com
Base Domain tocusinfotech.com
Scan Status Ok
Last Scan2025-10-21T10:31:40+00:00
Next Scan 2025-10-28T10:31:40+00:00

Last Scan

Scanned2025-10-21T10:31:40+00:00
URL https://tocusinfotech.com/robots.txt
Domain IPs 104.21.41.92, 172.67.190.164, 2606:4700:3033::6815:295c, 2606:4700:3037::ac43:bea4
Response IP 104.21.41.92
Found Yes
Hash 9114ae70203a2a0ea6c68ed009b476097c2560ed87d82741f3417e3af5570ff7
SimHash ae5b7847e5f1

Groups

googlebot

Rule Path
Disallow
Disallow /cgi-bin

bingbot

Rule Path
Disallow
Disallow /cgi-bin

slurp

Rule Path
Disallow
Disallow /cgi-bin

baiduspider

Rule Path
Disallow
Disallow /cgi-bin

yandexbot

Rule Path
Disallow
Disallow /cgi-bin

facebot

Rule Path
Disallow
Disallow /cgi-bin

ia_archiver

Rule Path
Disallow
Disallow /cgi-bin

mj12bot

Rule Path
Disallow
Disallow /cgi-bin

twitterbot

Rule Path
Disallow
Disallow /cgi-bin

*

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

Other Records

Field Value
sitemap https://tocusinfotech.com/sitemap.xml

Comments

  • Begin Attracta SEO Tools Sitemap. Do not remove
  • sitemap: http://cdn.attracta.com/sitemap/5292003.xml.gz
  • End Attracta SEO Tools Sitemap. Do not remove