worldlyinfo.com
robots.txt
Robots Exclusion Standard data for worldlyinfo.com
Resource Scan
Scan Details
| Site Domain | worldlyinfo.com |
| Base Domain | worldlyinfo.com |
| Scan Status | Ok |
| Last Scan | 2026-02-11T06:03:24+00:00 |
| Next Scan | 2026-02-18T06:03:24+00:00 |
Last Scan
| Scanned | 2026-02-11T06:03:24+00:00 |
| URL | https://worldlyinfo.com/robots.txt |
| Domain IPs | 2a02:4780:2b:2099:0:341d:aae:3, 82.25.83.186 |
| Response IP | 82.25.83.186 |
| Found | Yes |
| Hash | 1af5bd1da9c8d01b4a465be32831fdc91b7e7963f3b0bd431a99803276260af5 |
| SimHash | 772fd951c0a6 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Disallow | /wp-admin/admin-ajax.php |
| Disallow | /wp-content/uploads/wpforms/ |
| Disallow | /wp-includes/ |
| Disallow | /search |
| Disallow | /?s= |
| Disallow | /?page_id= |
| Disallow | /feed/ |
| Disallow | /comments/feed/ |
*
| Rule | Path |
|---|---|
| Disallow | /*?_bc_fsnf=1* |
| Disallow | /*%26_bc_fsnf%3D1* |
ai2bot
ai2bot-dolma
amazonbot
applebot
applebot-extended
bytespider
ccbot
chatgpt-user
claude-web
claudebot
diffbot
facebookbot
friendlycrawler
gptbot
google-extended
googleother
googleother-image
googleother-video
icc-crawler
isscyberriskcrawler
imagesiftbot
kangaroo bot
meta-externalagent
meta-externalfetcher
oai-searchbot
perplexitybot
petalbot
scrapy
sidetrade indexer bot
timpibot
velenpublicwebcrawler
webzio-extended
youbot
anthropic-ai
cohere-ai
facebookexternalhit
iaskspider/2.0
img2dataset
omgili
omgilibot
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://worldlyinfo.com/sitemap_index.xml |