cdn.scite.ai
robots.txt

Robots Exclusion Standard data for cdn.scite.ai

Resource Scan

Scan Details

Site Domain cdn.scite.ai
Base Domain scite.ai
Scan Status Ok
Last Scan2024-06-13T16:16:24+00:00
Next Scan 2024-07-13T16:16:24+00:00

Last Scan

Scanned2024-06-13T16:16:24+00:00
URL https://cdn.scite.ai/robots.txt
Domain IPs 108.156.133.112, 108.156.133.25, 108.156.133.71, 108.156.133.89, 2600:9000:2755:2400:1:f313:9b00:93a1, 2600:9000:2755:7400:1:f313:9b00:93a1, 2600:9000:2755:8a00:1:f313:9b00:93a1, 2600:9000:2755:9000:1:f313:9b00:93a1, 2600:9000:2755:9600:1:f313:9b00:93a1, 2600:9000:2755:9e00:1:f313:9b00:93a1, 2600:9000:2755:b600:1:f313:9b00:93a1, 2600:9000:2755:fa00:1:f313:9b00:93a1
Response IP 108.156.133.25
Found Yes
Hash f2045d93db6c6183cb158f559cf61b6169403ac3db508dbab4e6c93c5e393f94
SimHash 01500c574714

Groups

*

Rule Path
Allow *
Disallow /reference-check/
Disallow /visualizations/

Other Records

Field Value
sitemap https://cdn.scite.ai/sitemap.xml
sitemap https://cdn.scite.ai/sitemap-journals.xml
sitemap https://cdn.scite.ai/sitemap-funding-institutions.xml