Abstract: The proliferation of machine-learning workloads has accelerated the demand for higher memory bandwidth in modern systems. HBM DRAM was developed to break through the system-performance limit ...
Model: BGE-base-en-v1.5 with quantized HNSW indexes (using ONNX for on-the-fly query encoding) This page describes regression experiments, integrated into Anserini's regression testing framework, ...