malgusto.com
robots.txt

Robots Exclusion Standard data for malgusto.com

Resource Scan

Scan Details

Site Domain malgusto.com
Base Domain malgusto.com
Scan Status Ok
Last Scan2024-06-28T14:33:19+00:00
Next Scan 2024-07-05T14:33:19+00:00

Last Scan

Scanned2024-06-28T14:33:19+00:00
URL https://malgusto.com/robots.txt
Domain IPs 185.162.171.100
Response IP 185.162.171.100
Found Yes
Hash a9eaea122af6a58e9bf720a51f90be8b735f7d6fabbc5c3422306794764ad917
SimHash e865f88082a2

Groups

*

Rule Path
Allow /*.js$
Allow /*.css$
Disallow /*?*
Disallow /*?
Disallow */feed
Disallow */feed/
Disallow /feed
Disallow /feed/
Disallow /comments/feed
Disallow /feed/$
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$
Disallow /wp-includes/
Disallow /wp-admin/

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

noxtrumbotcrawl-delay: 50
msnbotcrawl-delay: 30
slurpcrawl-delay: 10

Rule Path
Disallow *?replytocom

Other Records

Field Value
sitemap https://www.malgusto.com/sitemap_index.xml