umbraxenu.no-ip.biz
robots.txt

Robots Exclusion Standard data for umbraxenu.no-ip.biz

Resource Scan

Scan Details

Site Domain umbraxenu.no-ip.biz
Base Domain umbraxenu.no-ip.biz
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-25T04:58:54+00:00
Next Scan 2024-10-25T04:58:54+00:00

Last Successful Scan

Scanned2024-09-02T04:57:47+00:00
URL https://umbraxenu.no-ip.biz/robots.txt
Domain IPs 209.216.101.141
Response IP 209.216.101.141
Found Yes
Hash 1afa4bfc2bd8acd9c0155fe003fd39a33760b06f8156c8e80b2e813786e4fe73
SimHash 241259b18cfe

Groups

mauibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot/2~bl

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

clark-crawler2/nutch

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

barkrowler/0.9

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

fidget-spinner-bot

Rule Path
Disallow /

thesis-researcg-bot

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

*

Rule Path
Disallow /*%26action%3Dedit
Disallow /*%26printable%3Dyes
Disallow /*%26mobileaction%3Dtoggle_view_desktop
Disallow /*%26diff%3D
Disallow /*%26oldid%3D
Disallow /*Special%3AUserLogin
Disallow /*Special%3ARecentChanges%26
Disallow /*Special%3ARecentChangesLinked%26
Disallow /*Special%3AWhatLinksHere
Disallow /*%26action%3Dhistory

Other Records

Field Value
sitemap https://umbraxenu.no-ip.biz/sitemap/misc.xml
sitemap https://umbraxenu.no-ip.biz/sitemap/sitemap-index-my_wiki.xml

Comments

  • I assumed most search bots would understand a Mediawiki site, but I guess not.
  • I'd overlook the lack of a required URL in the name, but it keeps trying UserLogin, disallowed below.
  • Bad bot that doesnt really look at robots.txt
  • Another idiot with a crawler.
  • Apparently A.C. Nielsen, and they morph the bot name.
  • A zombied crawler that tries too hard
  • A new "service" that I didn't ask for.
  • The lamest crawler ever, with no stated purpose. Checks robots.txt all the time. Bye-bye!
  • Ha, it reads robots.txt all the time but pays no attention to its block. Ban-hammer time!
  • Dubious stated purpose. My site is not your business plan. Experian.
  • Not seeeing my benefit in their grandiose business plan.
  • Doesn't check pay attention to blocks against parameters like edit. Bye-bye.
  • Doesn't have a link, scans too fast, so sod off!
  • Not sure what their business model is, but no benwfir for me.
  • Weirdly sticks to Sientology topics
  • Another one with zero benefit to me in their business plan
  • Another business that does me no good, and it tries to flip all the switches.
  • No link, heavy load, extremely dumb crawler.
  • The Amazon AI harvester is a switch flipper that tries all possible nonsense options.
  • Another one out of Amazon space. No link? Get lost!
  • Another one out of Amazon space. No link? Get lost!
  • I can't think of why Facebook needs to know about my site. Probably an AI harvester.
  • Since edits aren't allowed without a user id, let's save time and skip it.
  • Printable has to be redundant.
  • Google specifically asks for a desktop view and then complains that it doesn't fit a mobile screen. Facepalm!
  • No point to diff checking.
  • Are old versions important to search pages? Doubt it.
  • Login? I would have figured that most search engine crawlers would have "don't be a dick" rules...
  • Hey Yahoo!, don't be a dick!
  • Bingbot is being especially stupid with this, and trying every possible combination in rapid fire.
  • Bingbot is being stupid with this one too. No sign that it understands the output.
  • It seems that Google doesn't understand a Mediawiki WhatLinksHere page, or worse, thinks it does.
  • The page is returning the correct result no links found. I suggest you buy a clue Google.
  • And I'm sure that this one is wasted on the search bots too.
  • Google actually checks this once a day. And yet, I'm not sure they do much with it.
  • This isn't even my final form!