umbraxenu.no-ip.biz
robots.txt

Robots Exclusion Standard data for umbraxenu.no-ip.biz

Archived Snapshots

Resource Scan

Scan Details

Site Domain	umbraxenu.no-ip.biz
Base Domain	umbraxenu.no-ip.biz
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2024-09-25T04:58:54+00:00
Next Scan	2024-10-25T04:58:54+00:00

Last Successful Scan

Scanned	2024-09-02T04:57:47+00:00
URL	https://umbraxenu.no-ip.biz/robots.txt
Domain IPs	209.216.101.141
Response IP	209.216.101.141
Found	Yes
Hash	1afa4bfc2bd8acd9c0155fe003fd39a33760b06f8156c8e80b2e813786e4fe73
SimHash	241259b18cfe

Groups

mauibot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot/2~bl

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-ba

Rule	Path
Disallow	/

Rule

Path

Disallow

vegi bot

Rule	Path
Disallow	/

Rule

Path

Disallow

blexbot

Rule	Path
Disallow	/

Rule

Path

Disallow

seekport crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

domain re-animator bot

Rule	Path
Disallow	/

Rule

Path

Disallow

ltx71

Rule	Path
Disallow	/

Rule

Path

Disallow

garlikcrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

qwantify

Rule	Path
Disallow	/

Rule

Path

Disallow

clark-crawler2/nutch

Rule	Path
Disallow	/

Rule

Path

Disallow

dataforseobot

Rule	Path
Disallow	/

Rule

Path

Disallow

megaindex.ru

Rule	Path
Disallow	/

Rule

Path

Disallow

barkrowler/0.9

Rule	Path
Disallow	/

Rule

Path

Disallow

seokicks

Rule	Path
Disallow	/

Rule

Path

Disallow

anthropic-ai

Rule	Path
Disallow	/

Rule

Path

Disallow

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

fidget-spinner-bot

Rule	Path
Disallow	/

Rule

Path

Disallow

thesis-researcg-bot

Rule	Path
Disallow	/

Rule

Path

Disallow

facebookexternalhit

Rule	Path
Disallow	/

Rule

Path

Disallow

*

Rule	Path
Disallow	/*%26action%3Dedit
Disallow	/*%26printable%3Dyes
Disallow	/*%26mobileaction%3Dtoggle_view_desktop
Disallow	/*%26diff%3D
Disallow	/*%26oldid%3D
Disallow	/*Special%3AUserLogin
Disallow	/*Special%3ARecentChanges%26
Disallow	/*Special%3ARecentChangesLinked%26
Disallow	/*Special%3AWhatLinksHere
Disallow	/*%26action%3Dhistory

Rule

Path

Disallow

/*%26action%3Dedit

Disallow

/*%26printable%3Dyes

Disallow

/*%26mobileaction%3Dtoggle_view_desktop

Disallow

/*%26diff%3D

Disallow

/*%26oldid%3D

Disallow

/*Special%3AUserLogin

Disallow

/*Special%3ARecentChanges%26

Disallow

/*Special%3ARecentChangesLinked%26

Disallow

/*Special%3AWhatLinksHere

Disallow

/*%26action%3Dhistory

Other Records

Field	Value
sitemap	https://umbraxenu.no-ip.biz/sitemap/misc.xml
sitemap	https://umbraxenu.no-ip.biz/sitemap/sitemap-index-my_wiki.xml

Field

Value

sitemap

https://umbraxenu.no-ip.biz/sitemap/misc.xml

sitemap

https://umbraxenu.no-ip.biz/sitemap/sitemap-index-my_wiki.xml

Comments

I assumed most search bots would understand a Mediawiki site, but I guess not.
I'd overlook the lack of a required URL in the name, but it keeps trying UserLogin, disallowed below.
Bad bot that doesnt really look at robots.txt
Another idiot with a crawler.
Apparently A.C. Nielsen, and they morph the bot name.
A zombied crawler that tries too hard
A new "service" that I didn't ask for.
The lamest crawler ever, with no stated purpose. Checks robots.txt all the time. Bye-bye!
Ha, it reads robots.txt all the time but pays no attention to its block. Ban-hammer time!
Dubious stated purpose. My site is not your business plan. Experian.
Not seeeing my benefit in their grandiose business plan.
Doesn't check pay attention to blocks against parameters like edit. Bye-bye.
Doesn't have a link, scans too fast, so sod off!
Not sure what their business model is, but no benwfir for me.
Weirdly sticks to Sientology topics
Another one with zero benefit to me in their business plan
Another business that does me no good, and it tries to flip all the switches.
No link, heavy load, extremely dumb crawler.
The Amazon AI harvester is a switch flipper that tries all possible nonsense options.
Another one out of Amazon space. No link? Get lost!
Another one out of Amazon space. No link? Get lost!
I can't think of why Facebook needs to know about my site. Probably an AI harvester.
Since edits aren't allowed without a user id, let's save time and skip it.
Printable has to be redundant.
Google specifically asks for a desktop view and then complains that it doesn't fit a mobile screen. Facepalm!
No point to diff checking.
Are old versions important to search pages? Doubt it.
Login? I would have figured that most search engine crawlers would have "don't be a dick" rules...
Hey Yahoo!, don't be a dick!
Bingbot is being especially stupid with this, and trying every possible combination in rapid fire.
Bingbot is being stupid with this one too. No sign that it understands the output.
It seems that Google doesn't understand a Mediawiki WhatLinksHere page, or worse, thinks it does.
The page is returning the correct result no links found. I suggest you buy a clue Google.
And I'm sure that this one is wasted on the search bots too.
Google actually checks this once a day. And yet, I'm not sure they do much with it.
This isn't even my final form!

umbraxenu.no-ip.bizrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

mauibot

semrushbot

semrushbot/2~bl

semrushbot-ba

vegi bot

blexbot

seekport crawler

domain re-animator bot

ltx71

garlikcrawler

mj12bot

qwantify

clark-crawler2/nutch

dataforseobot

megaindex.ru

barkrowler/0.9

seokicks

anthropic-ai

amazonbot

fidget-spinner-bot

thesis-researcg-bot

facebookexternalhit

*

Other Records

Comments

umbraxenu.no-ip.biz
robots.txt