MarkTechPost

AI Research News

AI Research Videos

Researchers from Cerebras & Neural Magic Introduce Sparse Llama: The First Production LLM based on Llama at 70% Sparsity

3 days ago

By Asif Razzaq

FastGen: Cutting GPU Memory Costs Without Compromising on LLM Quality

8 days ago

Google AI Described New Machine Learning Methods for Generating Differentially Private Synthetic Data

Yesterday

By Pragati Jhunjhunwala

Researchers from Columbia University and Databricks Conducted a Comparative Study of LoRA and Full Finetuning in Large Language Models

Yesterday

By Asif Razzaq

Researchers from UC Berkeley, UIUC, and NYU Developed an Algorithmic Framework that Uses Reinforcement Learning (RL) to Optimize Vision-Language Models (VLMs)

1 hour ago

By Tanya Malhotra

Applications Category - Page 320 of 536

3 days ago

Language Model Category - Page 154 of 186

4 days ago

Technology Category - Page 596 of 605

10 days ago

Applications Category - Page 320 of 538

2 days ago

Toward Responsible Innovation: Evaluating Risks and Opportunities in Open Generative AI

5 hours ago

By Mohammad Asjad

Bisheng: An Open-Source LLM DevOps Platform Revolutionizing LLM Application Development

11 hours ago

By Niharika Singh

TRANSMI: A Machine Learning Framework to Create Baseline Models Adapted for Transliterated Data from Existing Multilingual Pretrained Language Models mPLMs without Any Training

17 hours ago

Applications Category - Page 435 of 538

4 days ago

Meet Verba 1.0: Run State-of-the-Art RAG Locally with Ollama Integration and Open Source Models

14 hours ago

By Asif Razzaq

Top AI Tools for Real Estate Agents

3 days ago

By Dhanshree Shripad Shenwai

The Pursuit of the Platonic Representation: AI’s Quest for a Unified Model of Reality

2 days ago

By Vineet Kumar

OpenAI Released GPT-4o for Enhanced Interactivity and Many Free Tools for ChatGPT Free Users

7 days ago

By Asif Razzaq

UC Berkeley Researchers Introduce Learnable Latent Codes as Bridges (LCB): A Novel AI Approach that Combines the Abstract Reasoning Capabilities of Large Language Models with Low-Level Action Policies

8 days ago

By Mohammad Asjad

Machine Learning Revolutionizes Path Loss Modeling with Simplified Features

Yesterday

By Vineet Kumar

Excited about GPT-4o? Now Check out Google AI’s New Project ‘Astra’: The Multimodal Answer to the New ChatGPT

5 days ago

ChuXin: A Fully Open-Sourced Language Model with a Size of 1.6 Billion Parameters

9 days ago

By Dhanshree Shripad Shenwai

DataSP: A Differentiable All-to-All Shortest Path Machine Learning Algorithm to Facilitate Learning Latent Costs from Trajectories

5 days ago

By Mohammad Arshad

How ‘Chain of Thought’ Makes Transformers Smarter

8 days ago

By Vineet Kumar

Microsoft Researchers Introduce Syntheseus: A Machine Learning Benchmarking Python Library for End-to-End Retrosynthetic Planning

6 days ago

By Dhanshree Shripad Shenwai

Researchers from Princeton and Meta AI Introduce ‘Lory’: A Fully-Differentiable MoE Model Designed for Autoregressive Language Model Pre-Training

8 days ago

Innovating Game Design with GPT: A Comprehensive Scoping Review

9 days ago

THRONE: Advancing the Evaluation of Hallucinations in Vision-Language Models

8 days ago

By Sana Hassan

Aloe: A Family of Fine-tuned Open Healthcare LLMs that Achieves State-of-the-Art Results through Model Merging and Prompting Strategies

9 days ago

By Asif Razzaq

Tsinghua University Researchers Propose ADELIE: Enhancing Information Extraction with Aligned Large Language Models Around Human-Centric Tasks

8 days ago

By Sana Hassan

Safe Marine Navigation Using Vision AI: Enhancing Maritime Safety and Efficiency

8 days ago

xLSTM: Enhancing Long Short-Term Memory LSTM Capabilities for Advanced Language Modeling and Beyond

10 days ago

By Sana Hassan

Sparse-Matrix Factorization-based Method: Efficient Computation of Latent Query and Item Representations to Approximate CE Scores

11 days ago

What Are The Dimensions For Creating Retrieval Augmented Generation (RAG) Pipelines?

12 days ago

By Tanya Malhotra

Meet Pyte: A Data Collaboration Platform that Preserves the Confidentiality of Data During Its Entire Data Lifecycle

18 days ago

By Dhanshree Shripad Shenwai

Deciphering Transformer Language Models: Advances in Interpretability Research

15 days ago

By Mohammad Asjad

Researchers at UC Berkeley Unveil a Novel Interpretation of the U-Net Architecture Through the Lens of Generative Hierarchical Models

19 days ago

BiomedRAG: Elevating Biomedical Data Analysis with Retrieval-Augmented Generation in Large Language Models

13 days ago

By Sana Hassan

Top Artificial Intelligence (AI) Governance Laws and Frameworks

18 days ago

By Tanya Malhotra

Top AI-Powered Cartoonizer Tools

11 days ago

By Mohammad Asjad

Huawei AI Introduces ‘Kangaroo’: A Novel Self-Speculative Decoding Framework Tailored for Accelerating the Inference of Large Language Models

18 days ago

By Sana Hassan

Gradformer: A Machine Learning Method that Integrates Graph Transformers (GTs) with the Intrinsic Inductive Bias by Applying an Exponential Decay Mask to the Attention Matrix

Apr 30

Iterative Preference Optimization for Improving Reasoning Tasks in Language Models

18 days ago

By Mohammad Asjad

Researchers from Stanford and Amazon Developed STARK: A Large-Scale Semi-Structure Retrieval AI Benchmark on Textual and Relational Knowledge Bases

19 days ago

By Vineet Kumar

A Comparative Analysis: Humans and AI Across Different Tasks

19 days ago

By Sana Hassan

This AI Paper from Apple Introduces a Weakly-Supervised Pre-Training Method for Vision Models Using Publicly Available Web-Scale Image-Text Data

Apr 29

By Tanya Malhotra

Top Data Science Courses in 2024

Apr 29

By Shobha Kakkar

This AI Paper from China Introduces TinyChart: An Efficient Multimodal Large Language Models MLLMs for Chart Understanding with Only 3B Parameters

Apr 30

By Mohammad Arshad

REBEL: A Reinforcement Learning RL Algorithm that Reduces the Problem of RL to Solving a Sequence of Relative Reward Regression Problems on Iteratively Collected Datasets

Apr 30

By Mohammad Asjad

Mistral.rs: A Lightning-Fast LLM Inference Platform with Device Support, Quantization, and Open-AI API Compatible HTTP Server and Python Bindings

Apr 28

By Niharika Singh

OpenVoice V2: Evolving Multilingual Voice Cloning with Enhanced Style Control and Cross-Lingual Capabilities

Apr 29

By Sana Hassan

For you Top stories Local Following

Search

Clear search

Close search

Google apps

Main menu