smartairfilters.com
robots.txt

Robots Exclusion Standard data for smartairfilters.com

Resource Scan

Scan Details

Site Domain smartairfilters.com
Base Domain smartairfilters.com
Scan Status Ok
Last Scan2024-05-28T09:46:16+00:00
Next Scan 2024-06-27T09:46:16+00:00

Last Scan

Scanned2024-05-28T09:46:16+00:00
URL https://smartairfilters.com/robots.txt
Domain IPs 104.26.4.208, 104.26.5.208, 172.67.69.49, 2606:4700:20::681a:4d0, 2606:4700:20::681a:5d0, 2606:4700:20::ac43:4531
Response IP 104.26.5.208
Found Yes
Hash deb47333002c129381ee14007843a0c0ac1b97cfa8848daeac42f4019add8bdf
SimHash 8a19d84a8b92

Groups

*

Rule Path
Disallow /mn_old/*
Disallow /daily-pollution/*
Disallow /redirect/*
Disallow /template/*

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /en/blog/*

gptbot

Rule Path
Disallow /en/blog/*

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://smartairfilters.com/en/sitemap_index.xml
sitemap https://smartairfilters.com/zh/sitemap_index.xml
sitemap https://smartairfilters.com/cn/en/sitemap_index.xml
sitemap https://smartairfilters.com/cn/zh/sitemap_index.xml
sitemap https://smartairfilters.com/in/en/sitemap_index.xml
sitemap https://smartairfilters.com/mn/en/sitemap_index.xml
sitemap https://smartairfilters.com/mn/mn/sitemap_index.xml
sitemap https://smartairfilters.com/ph/en/sitemap_index.xml
sitemap https://smartairfilters.com/bd/en/sitemap_index.xml
sitemap https://smartairfilters.com/th/en/sitemap_index.xml
sitemap https://smartairfilters.com/learn/sitemap_index.xml
sitemap https://smartairfilters.com/xuexi/sitemap_index.xml
sitemap https://smartairfilters.com/sitemaps/hreflang.xml