internetmarketingdeals.org
robots.txt

Robots Exclusion Standard data for internetmarketingdeals.org

Resource Scan

Scan Details

Site Domain internetmarketingdeals.org
Base Domain internetmarketingdeals.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan5/6/2025, 7:42:29 AM
Next Scan 8/4/2025, 7:42:29 AM

Last Successful Scan

Scanned1/12/2024, 7:21:39 PM
URL https://internetmarketingdeals.org/robots.txt
Domain IPs 104.21.59.158, 172.67.180.139, 2606:4700:3031::ac43:b48b, 2606:4700:3034::6815:3b9e
Response IP 104.21.59.158
Found Yes
Hash c61f46ac4c20a1b7a05819bb7d94bf139e16067df3d44cf42a8d95a3d736892f
SimHash 68585912c4e1

Groups

googlebot

Rule Path
Allow /sitemap.xml
Allow /sitemap.xml.gz

*

Rule Path
Disallow /cgi-bin/
Disallow /dl/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /cgi-bin/
Disallow /about
Disallow /contact
Disallow /wp-
Disallow /feed/
Disallow /rss/
Disallow /iframes/
Disallow /carp_evolution_4/
Disallow /scripts/
Disallow /go/
Disallow /likes/
Disallow /recommends/
Disallow /trackback
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.gz$
Disallow /*.cgi$
Disallow /*.wmv$
Disallow /*.png$
Disallow /*.gif$
Disallow /*.jpg$
Disallow /*.cgi$
Disallow /*.xhtml$
Disallow /*.php*
Disallow /wp-*
Allow /wp-content/uploads/

googlebot-image

Rule Path
Allow /*

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Comments

  • disallow all files in these directories
  • allow Google ImageBot to search all images
  • disallow archiving site
  • disable duggmirror