jumia.gw
robots.txt
Robots Exclusion Standard data for jumia.gw
Resource Scan
Scan Details
Site Domain | jumia.gw |
Base Domain | jumia.gw |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-08-23T07:32:28+00:00 |
Next Scan | 2024-11-21T07:32:28+00:00 |
Last Successful Scan
Scanned | 2024-01-27T05:01:16+00:00 |
URL | https://www.jumia.gw/robots.txt |
Domain IPs | 104.18.235.23, 104.18.236.23, 2606:4700::6812:eb17, 2606:4700::6812:ec17 |
Response IP | 104.18.235.23 |
Found | Yes |
Hash | 314a45f99a5351fd65c4230c83589a5550cac2793228d3e598c64423e7e88697 |
SimHash | f0004840c951 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?*sortBy=* |
Disallow | /*?*sortOrder=* |
Disallow | /*?*catalogView=* |
Disallow | /*favorites* |
Disallow | /*?*continue=* |
Disallow | /*posts/ |
Disallow | /*vip-ads? |
Disallow | /*bump? |
Disallow | /*paid-post? |
Disallow | /pwa/* |
Disallow | /*137010309* |
Allow | /*posts/new |
Allow | *.css |
Allow | *.js |
Other Records
Field | Value |
---|---|
sitemap | https://www.jumia.gw/sitemap.xml |