Researchers from Cerebras & Neural Magic Introduce Sparse Llama: The First Production LLM based on Llama at 70% Sparsity3 days agoAsif RazzaqBy Asif Razzaqmore_vert
Google AI Described New Machine Learning Methods for Generating Differentially Private Synthetic DataYesterdayPragati JhunjhunwalaBy Pragati Jhunjhunwalamore_vert
Researchers from Columbia University and Databricks Conducted a Comparative Study of LoRA and Full Finetuning in Large Language ModelsYesterdayAsif RazzaqBy Asif Razzaqmore_vert
Researchers from UC Berkeley, UIUC, and NYU Developed an Algorithmic Framework that Uses Reinforcement Learning (RL) to Optimize Vision-Language Models (VLMs)1 hour agoTanya MalhotraBy Tanya Malhotramore_vert
Toward Responsible Innovation: Evaluating Risks and Opportunities in Open Generative AI5 hours agoMohammad AsjadBy Mohammad Asjadmore_vert
Bisheng: An Open-Source LLM DevOps Platform Revolutionizing LLM Application Development11 hours agoNiharika SinghBy Niharika Singhmore_vert
TRANSMI: A Machine Learning Framework to Create Baseline Models Adapted for Transliterated Data from Existing Multilingual Pretrained Language Models mPLMs without Any Training17 hours agomore_vert
Meet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models14 hours agoAsif RazzaqBy Asif Razzaqmore_vert
Top AI Tools for Real Estate Agents3 days agoDhanshree Shripad ShenwaiBy Dhanshree Shripad Shenwaimore_vert
The Pursuit of the Platonic Representation: AI’s Quest for a Unified Model of Reality2 days agoVineet KumarBy Vineet Kumarmore_vert
OpenAI Released GPT-4o for Enhanced Interactivity and Many Free Tools for ChatGPT Free Users7 days agoAsif RazzaqBy Asif Razzaqmore_vert
UC Berkeley Researchers Introduce Learnable Latent Codes as Bridges (LCB): A Novel AI Approach that Combines the Abstract Reasoning Capabilities of Large Language Models with Low-Level Action Policies8 days agoMohammad AsjadBy Mohammad Asjadmore_vert
Machine Learning Revolutionizes Path Loss Modeling with Simplified FeaturesYesterdayVineet KumarBy Vineet Kumarmore_vert
Excited about GPT-4o? Now Check out Google AI’s New Project ‘Astra’: The Multimodal Answer to the New ChatGPT5 days agomore_vert
ChuXin: A Fully Open-Sourced Language Model with a Size of 1.6 Billion Parameters9 days agoDhanshree Shripad ShenwaiBy Dhanshree Shripad Shenwaimore_vert
DataSP: A Differentiable All-to-All Shortest Path Machine Learning Algorithm to Facilitate Learning Latent Costs from Trajectories5 days agoMohammad ArshadBy Mohammad Arshadmore_vert
Microsoft Researchers Introduce Syntheseus: A Machine Learning Benchmarking Python Library for End-to-End Retrosynthetic Planning6 days agoDhanshree Shripad ShenwaiBy Dhanshree Shripad Shenwaimore_vert
Researchers from Princeton and Meta AI Introduce ‘Lory’: A Fully-Differentiable MoE Model Designed for Autoregressive Language Model Pre-Training8 days agomore_vert
THRONE: Advancing the Evaluation of Hallucinations in Vision-Language Models8 days agoSana HassanBy Sana Hassanmore_vert
Aloe: A Family of Fine-tuned Open Healthcare LLMs that Achieves State-of-the-Art Results through Model Merging and Prompting Strategies9 days agoAsif RazzaqBy Asif Razzaqmore_vert
Tsinghua University Researchers Propose ADELIE: Enhancing Information Extraction with Aligned Large Language Models Around Human-Centric Tasks8 days agoSana HassanBy Sana Hassanmore_vert
xLSTM: Enhancing Long Short-Term Memory LSTM Capabilities for Advanced Language Modeling and Beyond10 days agoSana HassanBy Sana Hassanmore_vert
Sparse-Matrix Factorization-based Method: Efficient Computation of Latent Query and Item Representations to Approximate CE Scores11 days agomore_vert
What Are The Dimensions For Creating Retrieval Augmented Generation (RAG) Pipelines?12 days agoTanya MalhotraBy Tanya Malhotramore_vert
Meet Pyte: A Data Collaboration Platform that Preserves the Confidentiality of Data During Its Entire Data Lifecycle18 days agoDhanshree Shripad ShenwaiBy Dhanshree Shripad Shenwaimore_vert
Deciphering Transformer Language Models: Advances in Interpretability Research15 days agoMohammad AsjadBy Mohammad Asjadmore_vert
Researchers at UC Berkeley Unveil a Novel Interpretation of the U-Net Architecture Through the Lens of Generative Hierarchical Models19 days agomore_vert
BiomedRAG: Elevating Biomedical Data Analysis with Retrieval-Augmented Generation in Large Language Models13 days agoSana HassanBy Sana Hassanmore_vert
Top Artificial Intelligence (AI) Governance Laws and Frameworks18 days agoTanya MalhotraBy Tanya Malhotramore_vert
Huawei AI Introduces ‘Kangaroo’: A Novel Self-Speculative Decoding Framework Tailored for Accelerating the Inference of Large Language Models18 days agoSana HassanBy Sana Hassanmore_vert
Gradformer: A Machine Learning Method that Integrates Graph Transformers (GTs) with the Intrinsic Inductive Bias by Applying an Exponential Decay Mask to the Attention MatrixApr 30more_vert
Iterative Preference Optimization for Improving Reasoning Tasks in Language Models18 days agoMohammad AsjadBy Mohammad Asjadmore_vert
Researchers from Stanford and Amazon Developed STARK: A Large-Scale Semi-Structure Retrieval AI Benchmark on Textual and Relational Knowledge Bases19 days agoVineet KumarBy Vineet Kumarmore_vert
A Comparative Analysis: Humans and AI Across Different Tasks19 days agoSana HassanBy Sana Hassanmore_vert
This AI Paper from Apple Introduces a Weakly-Supervised Pre-Training Method for Vision Models Using Publicly Available Web-Scale Image-Text DataApr 29Tanya MalhotraBy Tanya Malhotramore_vert
This AI Paper from China Introduces TinyChart: An Efficient Multimodal Large Language Models MLLMs for Chart Understanding with Only 3B ParametersApr 30Mohammad ArshadBy Mohammad Arshadmore_vert
REBEL: A Reinforcement Learning RL Algorithm that Reduces the Problem of RL to Solving a Sequence of Relative Reward Regression Problems on Iteratively Collected DatasetsApr 30Mohammad AsjadBy Mohammad Asjadmore_vert
Mistral.rs: A Lightning-Fast LLM Inference Platform with Device Support, Quantization, and Open-AI API Compatible HTTP Server and Python BindingsApr 28Niharika SinghBy Niharika Singhmore_vert
OpenVoice V2: Evolving Multilingual Voice Cloning with Enhanced Style Control and Cross-Lingual CapabilitiesApr 29Sana HassanBy Sana Hassanmore_vert