technoogies.com
robots.txt

Robots Exclusion Standard data for technoogies.com

Resource Scan

Scan Details

Site Domain technoogies.com
Base Domain technoogies.com
Scan Status Ok
Last Scan2025-04-12T19:15:12+00:00
Next Scan 2025-04-19T19:15:12+00:00

Last Scan

Scanned2025-04-12T19:15:12+00:00
URL https://technoogies.com/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.32.1
Found Yes
Hash b8c3be3b4dd7b5be554ebfdc2e78caeabb9f348b2b2cbd8e7e0f6c3cd85ac3cb
SimHash 580498899ed7

Groups

*
*

Rule Path
Disallow /*blackhole
Disallow /?blackhole
Disallow /wp-admin/
Disallow /likes/
Disallow /backups-dup-pro/
Disallow /wp-login.php
Allow /wp-admin/admin-ajax.php

agentic
ai article writer
ai content detector
ai dungeon
ai search engine
ai seo crawler
ai writer
ai21 labs
ai2bot
aibot
aimatrix
aisearchbot
ai training
aitraining
alexa
alpha ai
alphaai
amazon bedrock
amazon-kendra
amazon lex
amazon comprehend
amazon sagemaker
amazon silk
amazon textract
amazonbot
amelia
anderspinkbot
anthropic
anypicker
anyword
applebot
aria browse
articoolo
automated writer
awariorssbot
awariosmartbot
bardbot
bingai
bingbot-chat
brave leo
bytedance
bytespider
catboost
cc-crawler
ccbot
chatglm
chinchilla
claude
clearscope
cohere
common crawl
commoncrawl
content harmony
content king
content optimizer
content samurai
contentatscale
contentbot
contentedge
conversion ai
copilot
copyai
copymatic
copyscape
cotoyogi
crawlq ai
crawlspace
crew ai
crewai
dall-e
dataforseobot
dataprovider
deepai
deepl
deepmind
deepseek
diffbot
doubao ai
duckassistbot
facebookbot
facebookexternalhit
firecrawl
flyriver
frase ai
friendlycrawler
gemini
gemma
genai
google bard ai
google-cloudvertexbot
google-extended
googleother
goose
gpt
grammarly
grendizer
grok
gt bot
gtbot
hemingway editor
hugging face
hypotenuse ai
iaskspider
icc-crawler
imagesiftbot
img2dataset
ink editor
inkforall
intelliseek
inferkit
isscyberriskcrawler
jasperai
kafkai
kangaroo
keyword density ai
komobot
llama
magpie-crawler
marketmuse
meltwater
meta ai
meta-ai
meta-external
metaai
metatagbot
mistral
narrative
neevabot
neural text
neuralseo
oai-searchbot
omgili
open ai
openai
openbot
opentext ai
outwrite
page analyzer ai
pangubot
paperlibot
paraphraser.io
perplexitybot
petalbot
phindbot
piplbot
prowritingaid
quillbot
robotspider
rytr
saplingai
scalenut
scraper
scrapy
scriptbook
seo content machine
seo robot
sentibot
sidetrade
simplified ai
skydancer
slickwrite
spin rewriter
spinbot
stability
stablediffusionbot
sudowrite
surfer ai
text blaze
textcortex
the knowledge ai
timpibot
vidnami ai
webzio
whisper
wordai
wordtune
wormsgtp
wpbot
writecream
writerzen
writescope
writesonic
xai
xbot
youbot
zero gtp
zerochat
zimm

Rule Path
Disallow /

Other Records

Field Value
sitemap https://technoogies.com/sitemap_index.xml
sitemap https://technoogies.com/sitemap-translate.xml

Comments

  • 2025-03-20
  • Blackhole for Bots
  • General Disallow Rules
  • Disallow Bad Bots & AI Bots
  • Ultimate AI Block List v1.3 20250310
  • https://perishablepress.com/ultimate-ai-block-list/
  • Other Bot Rules
  • User-agent: facebookexternalhit
  • Allow: /*?*smid=
  • User-agent: Twitterbot
  • Allow: /*?*smid=

Warnings

  • 1 invalid line.