44118太阳成城集团(中国)有限公司 - 搜狗百科

学术报告:Sampling-based Methods for Inner Product Sketching

发布时间:2024-08-23     浏览量:

报告题目:Sampling-based Methods for Inner Product Sketching

报告时间:202482614:30-15:30

报告地点:44118太阳成城集团B404

报告人:Christopher Musco

报告人国籍:美国

报告人单位:New York University

报告人简介:Christopher Musco is an assistant professor of Computer Science and Engineering at New York University. He is a member of NYU's Theoretical Computer Science group and the Visualization, Data Analysis, and Imaging Center (VIDA Center). His research focuses on the design and theoretical analysis of randomized algorithms with applications in computational mathematics, machine learning, and databases. Professor Musco's work has been funded by the US Department of Energy and National Science Foundation, including through an NSF CAREER Award.

报告摘要I will discuss new "sketching'' methods that can accurately estimate the inner product between any two vectors based on a small compression (or "sketch") of those vectors. Sketches for estimating inner products find applications across databases (for join-size estimation, correlation estimation, and more) and machine learning (for model evaluation, similarity computation, and vector search). Prior state-of-the-art sketching methods for inner products are based on Johnson-Lindenstrauss (JL) random projection and related methods. In contrast, the methods I will discuss are based on an old and powerful technique called "coordinated random sampling". I will prove that our sampling-based sketches enjoy stronger worst-case theoretical guarantees than sketches based on JL projection, and also perform better in practice across a wide variety of applications.

邀请人:王胜