automat.click
robots.txt

Robots Exclusion Standard data for automat.click

Resource Scan

Scan Details

Site Domain automat.click
Base Domain automat.click
Scan Status Ok
Last Scan2025-10-25T20:25:37+00:00
Next Scan 2025-11-24T20:25:37+00:00

Last Scan

Scanned2025-10-25T20:25:37+00:00
URL https://automat.click/robots.txt
Domain IPs 149.202.77.211, 2001:41d0:d:36d3::1
Response IP 149.202.77.211
Found Yes
Hash d34848f6d25854a4236ccfbf4ca332d31097310e44a9bdcbcc295d43793c65ae
SimHash e6851fe5f2f1

Groups

googlebot
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
google favicon
googlebot-news
googlebot-image
googlebot-video
mediapartners-google
apis-google
duplexweb-google
bingbot
slurp
duckduckbot
baiduspider
ahrefsbot
rogerbot
yandexbot
dotbot
twitterbot
bingpreview
linkedinbot
yandexbot
facebot
facebookexternalhit
msnbot
msnbot-media

Rule Path
Allow /
Allow /matomo.php
Allow /piwik.php
Allow /matomo.js
Allow /piwik.js
Allow /js/
Allow /blog.automat.click/
Allow /fediwall.automat.click/
Allow /peertube-instances.automat.click/
Allow /peertube-search.automat.click/

Other Records

Field Value
sitemap https://blog.automat.click/wp-sitemap-posts-page-1.xml

Comments

  • See http://www.robotstxt.org/orig.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /
  • To ban all spiders from only specific directories such as /people /u or /tag etc.
  • User-Agent: *
  • Disallow: /people/
  • Disallow: /u/
  • Disallow: /camo/
  • Disallow: /
  • Disallow: /people/
  • Disallow: /u/
  • Disallow: /camo/