Tap any word above to look it up or add it to your review deck
This pattern shows that one situation changes along with another. It is often used to describe trends or developments over time. In the text, it means that when the context window becomes larger, the cache increasingly becomes a major memory bottleneck.
"随着上下文窗口变大,这些缓存正成为主要的内存瓶颈。"
This structure gives the condition under which something happens. When used with negative words like “无需”, it means something can be done without needing a certain action first.
"TurboQuant可在无需重新训练或微调模型的情况下,将键值缓存压缩至3bit精度,同时基本保持模型准确率不受影响。"
“将” is used to mark the object before the verb, especially in formal or technical writing. Here it introduces the thing being changed, and “至” shows the final result or degree reached.
"TurboQuant可在无需重新训练或微调模型的情况下,将键值缓存压缩至3bit精度,同时基本保持模型准确率不受影响。"
“旨在” is a formal written expression meaning “to aim at” or “to be intended to.” It is commonly used in news, academic, and technical Chinese to state purpose.
"TurboQuant压缩技术旨在降低大语言模型和向量搜索引擎的内存占用。"
This is a common formal reporting structure. “对……的测试” means “tests on...”, and “显示” introduces the result or conclusion. It is useful in scientific and news writing.
"对包括Gemma等开源模型的测试显示,该技术可实现约6倍的键值缓存内存压缩效果。"
(bound form) bull's-eye; target
random access memory (RAM)
matrix
to compress; compression
to reduce; to lower; to bring down
system
probably
used in 可汗[ke4 han2]
technology; technique; skill
main; principal; major; primary
problem that impedes progress
precision
to have as its purpose
to use in; to use on; to use for
to maintain
language
artificial intelligence (AI)
Log in to leave a comment.
Loading comments...