馒头and花卷

2024年11月17日

Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

摘要：目录概UoT代码 Hu Z., Liu C., Feng X., Zhao Y., Ng S., Luu A. T., He J., Koh P. W. and Hooi B. Uncertainty of thoughts: Uncertainty-aware planning enhances 阅读全文

posted @ 2024-11-17 14:41 馒头and花卷阅读(3) 评论(0) 推荐(0) 编辑

2024年11月12日

Recursive Algorithm for Sliding Signal Processing

摘要：目录概滑动窗口上的快速算法 Farhang-Boroujeny B. and Gazor S. Generalized sliding fft and its application to implementation of block lms adaptive filters. TSP, 1994 阅读全文

posted @ 2024-11-12 21:20 馒头and花卷阅读(3) 评论(0) 推荐(0) 编辑

2024年11月6日

Frequent Directions

摘要：目录概Frequent DirectionsFrequent Directions over Slidding Windows代码 Ghashami M., Liberty E., Phillips J. M. and Woodruff D. P. Frequent directions : Sim 阅读全文

posted @ 2024-11-06 15:47 馒头and花卷阅读(13) 评论(0) 推荐(0) 编辑

2024年10月31日

Faster Local Solvers for Graph Diffusion Equations

摘要：目录概Graph Diffusion Equations 的传统近似解法Sequential local updates via Successive Overrelaxation (SOR)代码 Bai J., Zhou B., Yang D. and Xiao Y. Faster Local S 阅读全文

posted @ 2024-10-31 17:27 馒头and花卷阅读(7) 评论(0) 推荐(0) 编辑

2024年10月23日

STAR: A Simple Training-free Approach for Recommendations using Large Language Models

摘要：目录概符号说明STARRetrievalRanking最后的结果 Lee D., Kraft A., Jin L., Mehta N., Xu T., Hong L., Chi E. H. and Yi X. STAR: A simple training-free approach for rec 阅读全文

posted @ 2024-10-23 11:03 馒头and花卷阅读(24) 评论(0) 推荐(0) 编辑

2024年10月16日

Are Graph Augmentations Necessary? Simple Graph Contrastive Learning for Recommendation

摘要：目录概Graph CLSimGCL代码 Yu J., Yin H., Xia X., Chen T., Cui L. and Huang N. Q. V. Are graph augmentations necessary? simple graph contrastive learning for 阅读全文

posted @ 2024-10-16 14:03 馒头and花卷阅读(11) 评论(0) 推荐(0) 编辑

2024年10月13日

Unifying Graph Convolution and Contrastive Learning in Collaborative Filtering

摘要：目录概主要内容代码 Wu Y., Zhang L., Mo F., Zhu T., Ma W. and Nie J. Unifying graph convolution and contrastive learning in collaborative filtering. KDD, 2024. 阅读全文

posted @ 2024-10-13 20:59 馒头and花卷阅读(10) 评论(0) 推荐(0) 编辑

Contrastive Learning Is Spectral Clustering On Similarity Graph

摘要：目录概主要内容原文代码 Tan Z., Zhang Y., Yang J. and Yuan Y. Contrastive learning is spectral clustering on similarity graph. ICLR, 2024. 概本文将对比学习与谱聚类联系在一起. 主要内阅读全文

posted @ 2024-10-13 17:07 馒头and花卷阅读(10) 评论(0) 推荐(0) 编辑

2024年10月11日

Auxiliary Learning by Implicit Differentiation

摘要：目录概AuxiLearn问题设定理解两阶段的训练代码 Navon A., Achituve I., Maron H., Chechik G. and Fetaya E. Auxiliary learning by implicit differentiation. ICLR, 2021. 概通过阅读全文

posted @ 2024-10-11 16:34 馒头and花卷阅读(23) 评论(0) 推荐(0) 编辑

2024年10月6日

Long-Sequence Recommendation Models Need Decoupled Embeddings

摘要：目录概Decoupled Attention and Representation Embeddings (DARE) model Feng N., Pang J., Wu J., Chen B., Wang X., Li Q., Hu X., Jiang J. and Long M. Long-s 阅读全文

posted @ 2024-10-06 10:29 馒头and花卷阅读(36) 评论(0) 推荐(0) 编辑

2024年9月20日

Modularity-based Graph Clustering

摘要：目录概符号说明ModularityAgglomerative Hierarchical ClusteringLouvainModularity-based Graph ClusteringRabbit代码 [1] Newman M. E. J. and GirvanM. Finding and ev 阅读全文

posted @ 2024-09-20 15:02 馒头and花卷阅读(7) 评论(0) 推荐(0) 编辑

2024年9月11日

Adafactor: Adaptive Learning Rates with Sublinear Memory Cost

摘要：目录概符号说明AdafactorFactored Second Moment EstimationNo MomentumOut-of-Date Second Moment Estimator算法代码 Shazeer N. and Stern M. Adafactor: Adaptive learni 阅读全文

posted @ 2024-09-11 15:28 馒头and花卷阅读(43) 评论(0) 推荐(0) 编辑

2024年9月10日

Memory-Efficient Adaptive Optimization

摘要：目录概符号说明SM3区间的划分代码 Anil R., Gupta V., Koren T., Singer Y. Memory-efficient adaptive optimization. NeurIPS, 2019. 概本文提出了一种 memory-efficient 的优化器: SM3. 阅读全文

posted @ 2024-09-10 21:25 馒头and花卷阅读(8) 评论(0) 推荐(0) 编辑

2024年9月8日

A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs

摘要：目录概METISCoarseningPartitioning phaseUncoarsening phase Karypis G. and Kumar V. A fast and high quality multilevel scheme for partitioning irregular gr 阅读全文

posted @ 2024-09-08 17:21 馒头and花卷阅读(11) 评论(0) 推荐(0) 编辑

Graph Edge Partitioning via Neighborhood Heuristic

摘要：目录概符号说明Vertex vs Edge partitioningNE (Neighbor Expansion)代码 Zhang C., Wei F., Liu Q., Tang Z. G. and Li Z. Graph edge partitioning via neighborhood he 阅读全文

posted @ 2024-09-08 14:17 馒头and花卷阅读(14) 评论(0) 推荐(0) 编辑

2024年8月29日

DCN V2 Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems

摘要：目录概DCN-v2 Wang R., Shivanna R., Cheng D. Z., Jain S., Lin D., Hong L. and Chi E. D. DCN V2: Improved deep & cross network and practical lessons for we 阅读全文

posted @ 2024-08-29 11:19 馒头and花卷阅读(43) 评论(2) 推荐(1) 编辑

2024年8月28日

Adam-mini Use Fewer Learning Rates To Gain More

摘要：目录概Adam-mini代码 Zhang Y., Chen C., Li Z., Ding T., Wu C., Ye Y., Luo Z. and Sun R. Adam-mini: Use fewer learning rates to gain more. arXiv preprint, 20 阅读全文

posted @ 2024-08-28 15:58 馒头and花卷阅读(20) 评论(0) 推荐(0) 编辑

2024年8月27日

GaLore Memory-Efficient LLM Training by Gradient Low-Rank Projection

摘要：目录概符号说明GaLore Zhao J., Zhang Z., Chen B., Wang Z., Anandkumar A. and Tian Y. GaLore: Memory-efficient llm training by gradient low-rank projection. IC 阅读全文

posted @ 2024-08-27 16:05 馒头and花卷阅读(55) 评论(0) 推荐(0) 编辑

BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models

摘要：目录概BAdam代码 Luo Q., Yu H. and Li X. BAdam: A memory efficient full parameter optimization method for large language models. arXiv preprint, 2024. 概本文介阅读全文

posted @ 2024-08-27 10:12 馒头and花卷阅读(65) 评论(0) 推荐(0) 编辑

2024年8月26日

Why Transformers Need Adam: A Hessian Perspective

摘要：目录概符号说明所有参数的 Hessian 矩阵Block-wise Hessian代码 Zhang Y., Chen C., Ding T., Li Z., Sun R. and Luo Z. Why transformers need adam: a hessian perspective. ar 阅读全文

posted @ 2024-08-26 17:13 馒头and花卷阅读(40) 评论(0) 推荐(0) 编辑

公告