maspero.eg
robots.txt
Robots Exclusion Standard data for maspero.eg
Resource Scan
Scan Details
Site Domain | maspero.eg |
Base Domain | maspero.eg |
Scan Status | Ok |
Last Scan | 2024-11-16T09:02:25+00:00 |
Next Scan | 2024-11-23T09:02:25+00:00 |
Last Scan
Scanned | 2024-11-16T09:02:25+00:00 |
URL | https://www.maspero.eg/robots.txt |
Domain IPs | 13.107.246.59, 2620:1ec:bdf::59 |
Response IP | 13.107.246.59 |
Found | Yes |
Hash | e1bd683f39c9790b647fc40ad4b806b9be4335054432c6f622ba673b576518ab |
SimHash | c60ddc716703 |
Groups
*
Rule | Path |
---|---|
Disallow | /wps/ |
Disallow | /wpurl/ |
Disallow | /search/ |
Disallow | /search?q= |
Disallow | /search?q=* |
awariorssbot
awariosmartbot
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 10 |
*
Rule | Path |
---|---|
Disallow | / |
amazonbot
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
weborama-fetcher
yandexbot
surdotlybot/1.0
trendictionbot
fidget-spinner-bot
mj12bot/v1.4.8
paqlebot/2.0
buck/2.3.2
dataofrseobot/1.0
criteobot/0.1
my-tiny-bot
nimbostratus-bot/v1.3.2
peer39_crawler/1.0
twingly recon-klondike/1.0
admantx-ussy04/3.2
gnowitnewsbot
openindexspider
gptbot
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.maspero.eg/sitemap_index.xml |
Comments