it-blogger.net
robots.txt

Robots Exclusion Standard data for it-blogger.net

Resource Scan

Scan Details

Site Domain it-blogger.net
Base Domain it-blogger.net
Scan Status Ok
Last Scan2025-12-07T04:44:44+00:00
Next Scan 2025-12-14T04:44:44+00:00

Last Scan

Scanned2025-12-07T04:44:44+00:00
URL https://it-blogger.net/robots.txt
Domain IPs 85.13.140.44
Response IP 85.13.140.44
Found Yes
Hash 3e19b013906f0e1cb644b2e6d95645fffcdc5e00a6ed3a62fcfdb895e1912a19
SimHash 9f6f68019db4

Groups

*

Rule Path
Allow /

anthropicbot
barkrowler
blexbot
ccbot
claude
claudebot
cazoodlebot
dotbot/1.0
dataforseobot
geedobot
gigabot
go-http-client
imagesiftbot
petalbot
riddler
nuclei
node/simplecrawler
mj12bot
magpie-crawler
repolookoutbot
serpstatbot
seokicks
wikido
zoominfobot
meta-externalagent

Rule Path
Disallow /

Other Records

Field Value
sitemap https://it-blogger.net/sitemap.xml
sitemap https://it-blogger.net/news-sitemap.xml
sitemap https://it-blogger.net/sitemap_index.xml
sitemap https://it-blogger.net/sitemap-index-1.xml

Comments

  • robots.txt for https://it-blogger.net/
  • News: https://it-blogger.net/news-sitemap.xml
  • Ban bots that don't benefit us.
  • User-agent: GPTBot
  • User-agent: ChatGPT-User
  • User-Agent: FacebookBot
  • Disallow: /