ع
Dashboard

AI Intelligence Dashboard 2025

Interactive analysis of the global and Arab AI scene by AI Workshop

AI Models Database

Explore the leading models. Use filters to customize your search in over 200 models.

GPT-4o

OpenAI

A leading native multimodal model, integrating real-time text, audio, and image understanding for natural and fast human-like interaction.

Real-time ConversationText Generationcommercial
More Details

GPT-4 Turbo

OpenAI

An enhanced version of GPT-4 with a massive context window (128k tokens) and knowledge updated to April 2023.

Text GenerationCode Generationcommercial
More Details

Claude 3.5 Sonnet

Anthropic

A leading model that balances intelligence and speed, excellent for coding, data analysis, and computer vision.

Text GenerationCode Generationcommercial
More Details

Claude 3 Opus

Anthropic

The most powerful model in the Claude 3 family, designed for highly complex tasks requiring near-human intelligence.

Complex ReasoningAdvanced Analysiscommercial
More Details

Claude 3 Haiku

Anthropic

The fastest and smallest model in the Claude 3 family, ideal for instant interactions and customer service.

Live Chat SupportContent Moderationcommercial
More Details

Gemini 1.5 Pro

Google

A multimodal model with a massive context window (1 million tokens), ideal for analyzing huge amounts of data.

Text GenerationVisual Question Answeringcommercial
More Details

Gemini 1.5 Flash

Google

A lightweight and fast model from the Gemini 1.5 family, optimized for tasks requiring quick responses at scale.

SummarizationChat Applicationscommercial
More Details

Gemini 1.0 Ultra

Google

The first model in the Gemini family, designed for highly complex tasks and excelling at understanding nuances.

Advanced ReasoningMultimodal Understandingcommercial
More Details

Command R+

Cohere

A model designed for enterprise applications, focusing on reliable RAG, tool use, and multilingual performance.

Text GenerationRAGcommercial / cc-by-nc-4.0
More Details

Command R

Cohere

The base model of the Command family, balancing performance and cost for enterprise applications.

RAGTool Usecommercial / cc-by-nc-4.0
More Details

Mistral Large

Mistral AI

A leading language model from Mistral AI, featuring strong reasoning capabilities and high multilingual performance.

Text GenerationCode Generationcommercial
More Details

Grok-1.5

xAI

An improved version of Grok with enhanced reasoning capabilities and a longer context.

Real-world UnderstandingLong-context Reasoningcommercial-preview
More Details

Jurassic-2 (J2)

AI21 Labs

A family of language models (Jumbo, Grande, Large) focusing on text quality and precise instruction following.

Instruction FollowingText Generationcommercial
More Details

Inflection-2.5

Inflection AI

The model that powers the Pi assistant, designed for empathetic and helpful personal conversations.

Conversational AIEmotional Intelligencecommercial
More Details

Amazon Titan Text Premier

Amazon

Amazon's latest and largest language model, designed for advanced enterprise applications on the Bedrock platform.

Enterprise SearchSummarizationcommercial
More Details

Llama-3.1-405B

Meta

The most powerful open-source model, offering transparency and extensive customization at a large scale.

Text GenerationComplex Reasoningllama3.1
More Details

Llama-3.1-70B

Meta

A strong and balanced open-source model, providing excellent performance for a wide range of tasks.

Text GenerationCode Generationllama3.1
More Details

Llama-3.1-8B

Meta

The smallest Llama 3.1 model, ideal for experiments and applications requiring high efficiency.

Text GenerationInstruction Followingllama3.1
More Details

Mixtral 8x22B

Mistral AI

A high-performance open Mixture of Experts (MoE) model, using 8 experts for high efficiency.

Text GenerationCode Generationapache-2.0
More Details

Mixtral 8x7B

Mistral AI

The original MoE model that revolutionized open performance, fast and efficient.

Text GenerationReasoningapache-2.0
More Details

Mistral 7B

Mistral AI

A small but very powerful model, considered an excellent starting point for many fine-tuned models.

Text GenerationFine-tuningapache-2.0
More Details

Gemma 2 27B

Google

The second generation of Gemma models, offering leading performance in its size class with high efficiency.

Text GenerationReasoninggemma-2
More Details

Gemma 2 9B

Google

A mid-size version of Gemma 2, ideal for on-device applications.

On-device AIText Generationgemma-2
More Details

Jais-30B-v3

Core42

The highest quality Arabic language model, open-source and bilingual (Arabic/English).

Text Generationapache-2.0
More Details

Aya 23

Cohere For AI

A multilingual model covering 23 languages, developed as part of a broad community effort.

Multilingual Text Generationapache-2.0
More Details

Nemotron-4 340B

NVIDIA

A massive open-source model designed to generate synthetic data for training other models.

Synthetic Data GenerationText Generationnvidia-open-model
More Details

Phi-3 Family (SLM)

Microsoft

A family of open-source small language models (SLM) (3.8B, 7B, 14B), very efficient for on-device applications.

On-device AIVisual Question Answeringmit
More Details

Qwen2

Alibaba Cloud

The second generation of Qwen models, with significant improvements in performance and multilingual capabilities.

Text GenerationChatapache-2.0
More Details

DBRX

Databricks

A powerful open MoE model, showing excellent performance in coding and general tasks.

Code GenerationText Generationdatabricks-open-model
More Details

Falcon 180B

TII

One of the largest open models, developed in the UAE and known for its strong performance.

Text GenerationReasoningtii-falcon-llm
More Details

OLMo

AI2

A family of fully open-source models from the Allen Institute for AI, including training data and tools.

AI ResearchText Generationapache-2.0
More Details

Yi-1.5

01.AI

The new generation of Yi models, available in various sizes and featuring strong conversation and vision capabilities.

Text GenerationChatyi-license
More Details

OpenELM

Apple

A family of open and efficient language models from Apple, designed to work efficiently on-device.

On-device AIText Generationother
More Details

Mamba

Carnegie Mellon & Princeton

A new model architecture (State Space Model) aiming to compete with the Transformer architecture with higher efficiency.

Long-context ReasoningSequence Modelingapache-2.0
More Details

Vicuna

LMSYS

An open-source conversational model fine-tuned from Llama, known for its high quality in dialogue.

ChatbotConversational AIapache-2.0
More Details

Zephyr

Hugging Face

A series of fine-tuned models that focus on better intent and instruction following.

Instruction FollowingChatmit
More Details

BLOOM

BigScience

A massive multilingual language model, developed in an open collaborative effort involving over 1000 researchers.

Multilingual Text Generationbigscience-rail-v1
More Details

MPT-30B

MosaicML

An open-source model from MosaicML (now part of Databricks) featuring a long context window and high quality.

Long-context QASummarizationapache-2.0
More Details

Orca 2

Microsoft

A model that focuses on improving the reasoning abilities of small models by simulating the thought processes of larger models.

ReasoningInstruction Followingmit
More Details

XLM-RoBERTa

Meta

A very powerful multilingual model, pre-trained on 2.5 terabytes of data in 100 languages.

Cross-lingual UnderstandingMultilingual Classificationmit
More Details

WizardLM

Microsoft

A family of models fine-tuned using complex, self-generated instruction data, enhancing their reasoning capabilities.

Complex Instruction FollowingReasoningmit
More Details

OpenChat

OpenChat

A set of open-source conversational models fine-tuned with diverse data to achieve strong conversational performance.

ChatbotConversational AIapache-2.0
More Details

Cerebras-GPT

Cerebras

A set of open-source language models released to demonstrate the capabilities of Cerebras hardware.

Text Generationapache-2.0
More Details

OpenHermes

Teknium

One of the most popular fine-tuned models from Mistral, known for its high quality in conversation and coding.

ChatCode Generationmit
More Details

Dolphin

Eric Hartford

A family of uncensored fine-tuned models (usually from Llama or Mistral), aiming to provide direct answers.

Uncensored ChatCreative Writingother
More Details

SeaLLM

AI Singapore

A family of language models designed specifically for Southeast Asian languages and cultures.

Southeast Asian NLPMultilingual Chatapache-2.0
More Details

BELLE

Lianjia

An open-source Chinese project focusing on producing high-quality instruction data and fine-tuned models.

Chinese Instruction FollowingChatother
More Details

ChatGLM3

Tsinghua University

The third generation of open bilingual (Chinese/English) conversational models.

Bilingual ChatText Generationapache-2.0
More Details

InternLM2

Shanghai AI Lab

The second generation of InternLM models, with improved performance across various tasks.

Text GenerationChatapache-2.0
More Details

LEIA

Large European AI consortium

A European consortium aiming to build large, open-source, multilingual language models specific to Europe.

European Multilingual NLPin-development
More Details

GPT-SW3

AI Sweden

A set of large language models trained specifically for Nordic languages.

Nordic Language NLPapache-2.0
More Details

Nous-Hermes 2

Nous Research

One of the strongest fine-tuned models from Mistral and Llama, trained on high-quality synthetic data.

ChatReasoningmit
More Details

SOLAR-10.7B

Upstage

An open-source model that combines different models to achieve high performance while maintaining a relatively small size.

Text GenerationInstruction Followingapache-2.0
More Details

Stable Diffusion 3

Stability AI

A powerful open-source model for image generation, known for its flexibility and ability to understand complex text.

Text-to-ImageImage-to-Imageother
More Details

Stable Diffusion XL

Stability AI

An improved version of Stable Diffusion that provides higher image quality and better compositions.

High-resolution Image Generationother
More Details

Midjourney V6

Midjourney

A leading commercial model for generating artistic and realistic images, characterized by very high aesthetic quality.

Text-to-ImageArt Generationcommercial
More Details

DALL-E 3

OpenAI

The image generation model from OpenAI, integrated into ChatGPT and Bing, focusing on ease of use.

Text-to-Imagecommercial
More Details

Imagen 3

Google

Google's latest model for image generation, designed to create realistic and creative images with a deep understanding of details.

Text-to-ImagePhotorealismcommercial
More Details

Ideogram 1.0

Ideogram AI

A model specialized in generating images that contain readable and clear text.

Text-to-ImageTypography Generationcommercial
More Details

LLaVA-NeXT

various

A leading open-source vision-language model, combining large language models with image understanding capabilities for conversation about them.

Visual Instruction TuningVisual Chatapache-2.0
More Details

Idefics2

Hugging Face

An open-source vision-language model from Hugging Face, known for its efficiency in handling images and text.

Visual Question AnsweringImage Descriptionapache-2.0
More Details

Florence-2

Microsoft

A unified and lightweight vision model that can handle a variety of tasks.

Object DetectionImage Captioningmit
More Details

YOLOv10

Tsinghua University

Real-time object detection with high efficiency and excellent accuracy.

Real-time Object Detectionagpl-3.0
More Details

YOLOv8

Ultralytics

One of the most common and widely used versions of YOLO, known for its ease of use and strong performance.

Real-time Object DetectionImage Segmentationagpl-3.0
More Details

YOLO-NAS

Deci AI

A new YOLO architecture found using Neural Architecture Search (NAS), to achieve an optimal balance between accuracy and speed.

Real-time Object Detectionother
More Details

ControlNet

Stanford University

A method for controlling image generation models by adding conditions such as poses or image edges.

Conditional Image Generationapache-2.0
More Details

CLIP

OpenAI

A foundational model that connects text and images, allowing for tasks like text-based image search and classification.

Zero-shot ClassificationImage-Text Matchingmit
More Details

Segment Anything (SAM)

Meta

An image segmentation model that can identify and isolate any object in any image with a single click.

Image SegmentationObject Maskingapache-2.0
More Details

Vision Transformer (ViT)

Google

A foundational model that showed the Transformer architecture can excel at computer vision tasks without needing CNNs.

Image Classificationapache-2.0
More Details

DeepFloyd IF

Stability AI

An image generation model that relies on diffusion in pixel space, allowing for high quality and excellent text generation capability.

Text-to-ImageTypography Generationother
More Details

Kandinsky 3.0

Sber AI & others

The third generation of the image generation model, with improvements in quality and instruction-following ability.

Text-to-ImageImage Inpaintingother
More Details

StyleGAN3

NVIDIA

The third generation of StyleGAN, designed to create high-quality images with precise control over style.

High-fidelity Image Synthesisnvidia-source-code
More Details

ESRGAN

various

A popular model for image super-resolution, capable of adding realistic details.

Image Super-Resolutionapache-2.0
More Details

DETR (DEtection TRansformer)

Meta

A model that treats object detection as a direct set prediction problem.

Object DetectionPanoptic Segmentationapache-2.0
More Details

Swin Transformer

Microsoft Research

A hierarchical vision transformer architecture that achieves high efficiency by calculating attention within shifted windows.

Image ClassificationObject Detectionmit
More Details

DINOv2

Meta

A vision model that learns strong image representations through self-supervised learning, making it excellent for downstream tasks.

Self-supervised LearningFeature Extractionapache-2.0
More Details

OpenPose

Carnegie Mellon University

A popular and influential library for real-time multi-person body, face, hand, and foot pose estimation.

Pose EstimationKeypoint Detectionother
More Details

SwinIR

various

An image restoration model based on Swin Transformer that achieves excellent results in tasks like denoising and super-resolution.

Image Super-ResolutionImage Denoisingmit
More Details

Mask R-CNN

Meta

A highly influential and foundational architecture for instance segmentation, which predicts a mask for each detected object.

Instance SegmentationObject Detectionmit
More Details

U-Net

University of Freiburg

A convolutional neural network architecture originally developed for biomedical image segmentation, which has become a standard in the field.

Biomedical Image SegmentationSemantic Segmentationconceptual
More Details

CogVLM

Tsinghua University

A powerful open-source vision-language model, featuring excellent capabilities in complex visual conversation.

Visual ChatVisual Groundingapache-2.0
More Details

BLIP-2

Salesforce Research

An efficient architecture for training vision-language models, achieving excellent performance on tasks like captioning and visual question answering.

Image CaptioningVisual Question Answeringbsd-3-clause
More Details

FastSAM

CASIA

A fast alternative to the original SAM model, achieving similar results at a much higher speed, making it suitable for real-time applications.

Real-time Image Segmentationapache-2.0
More Details

Sora

OpenAI

An advanced text-to-video model, capable of creating realistic and complex scenes up to a minute long.

Text-to-Videocommercial
More Details

KLING

Kuaishou

A competing Chinese model for generating realistic, high-definition (1080p) videos up to two minutes long.

Text-to-VideoImage-to-Videocommercial
More Details

Lumiere

Google

A text-to-video model that uses a Space-Time U-Net architecture to generate realistic and coherent motion.

Text-to-VideoVideo Stylizationcommercial
More Details

Pika 1.0

Pika Labs

An easy-to-use platform for generating and editing video with AI.

Text-to-VideoImage-to-Videocommercial
More Details

Runway Gen-2

RunwayML

One of the leading models in the market for generating video from text or images.

Text-to-VideoImage-to-Videocommercial
More Details

Stable Video Diffusion

Stability AI

An open-source model for generating short video clips from images.

Image-to-Videoother
More Details

Make-A-Video

Meta

One of the early and influential models in the text-to-video space.

Text-to-Videoresearch-only
More Details

VEO

Google

Google's latest model for generating high-quality (1080p+) video with a deep understanding of language and cinematic concepts.

High-quality Text-to-Videocommercial-preview
More Details

Whisper-large-v3

OpenAI

An accurate speech-to-text model, supporting multiple languages with great proficiency and efficiency.

Automatic Speech Recognitionmit
More Details

Suno V3

Suno AI

An advanced model for generating full songs and music (including vocals) from a simple text description.

Text-to-MusicSong Generationcommercial
More Details

Udio

Udio AI

A powerful model for generating high-quality music and songs, considered a major competitor to Suno.

Text-to-MusicSong Generationcommercial
More Details

MusicGen

Meta

A model for generating high-quality music clips from a text description.

Text-to-MusicMelody Conditioningcc-by-nc-4.0
More Details

XTTS-v2

Coqui AI

A high-quality text-to-speech model with voice cloning across multiple languages.

Text-to-SpeechVoice Cloningapache-2.0
More Details

ElevenLabs TTS

ElevenLabs

A leading platform for realistic speech generation and voice cloning, known for its extremely natural sound quality.

Text-to-SpeechVoice Cloningcommercial
More Details

Bark

Suno AI

An open-source text-to-speech model, capable of generating very realistic voices with non-verbal cues.

Expressive Text-to-Speechmit
More Details

VALL-E X

Microsoft

A text-to-speech model capable of cloning a person's voice from a short sample (3 seconds).

Zero-shot TTSVoice Cloningresearch-only
More Details

SeamlessM4T v2

Meta

A comprehensive translation model supporting speech-to-speech, speech-to-text, and text-to-speech across 100 languages.

Speech-to-Speech TranslationASRcc-by-nc-4.0
More Details

Wav2Vec 2.0

Meta

A foundational model for learning speech representations from unlabeled data, widely used in speech recognition.

Self-supervised Speech RepresentationASRmit
More Details

Tacotron 2

Google

A neural network architecture for generating speech directly from text, which is the basis for many modern TTS systems.

Text-to-Speech Synthesisapache-2.0
More Details

Riffusion

Riffusion

A model that uses Stable Diffusion to generate spectrograms and then converts them into music.

Text-to-MusicImage-to-Musicmit
More Details

Jukebox

OpenAI

An early and powerful model for generating music with rudimentary vocals, known for its high quality but very slow speed.

Music GenerationSinging Voice Synthesismit
More Details

MMS (Massively Multilingual Speech)

Meta

A massive project providing models for speech recognition, generation, and language identification for over 1100 languages.

Multilingual ASRMultilingual TTScc-by-nc-4.0
More Details

VITS

various

A popular speech generation architecture that combines VAEs and GANs to efficiently generate high-quality speech.

High-quality Text-to-Speechmit
More Details

CLAP

Microsoft

A model that connects text and audio, allowing for tasks like text-based audio search and classification.

Zero-shot Audio ClassificationAudio-Text Matchingmit
More Details

StyleTTS 2

various

A text-to-speech model that achieves very high quality with precise style control without needing a reference voice.

Expressive Text-to-SpeechStyle Diffusionmit
More Details

AlphaCode 2

Google DeepMind

An advanced code generation model that solves competitive programming problems at a level comparable to top programmers.

Competitive ProgrammingComplex Code Generationresearch-access
More Details

GitHub Copilot

GitHub (Microsoft)

The most popular AI-powered programming assistant, using OpenAI models to provide smart code suggestions.

Code CompletionCode Generationcommercial
More Details

Code Llama 70B

Meta

A model specialized in generating and understanding code, one of the most powerful open models in this field.

Code GenerationCode Completionllama2
More Details

DeepSeek-Coder-V2

DeepSeek AI

A very powerful open-source model specialized in programming, supporting 338 programming languages.

Code GenerationCode Completionmit
More Details

CodeGemma

Google

A family of open and lightweight models specialized in efficient code completion and generation.

Code GenerationFill-in-the-Middlegemma
More Details

StarCoder 2

BigCode (ServiceNow & Hugging Face)

The second generation of StarCoder models, trained on a massive amount of code.

Code GenerationCode Assistantbigcode-openrail-m
More Details

CodeGen2

Salesforce

An open-source model for code generation, capable of working with multiple programming languages.

Program SynthesisCode Generationapache-2.0
More Details

WizardCoder

WizardLM Team

A Llama model fine-tuned on a large programming dataset, achieving excellent performance.

Code GenerationInstruction Followingmit
More Details

Phind-Coder

Phind

An open-source programming model fine-tuned on high-quality data, known for its speed and accuracy.

Code GenerationDeveloper Q&Aother
More Details

AlphaFold 3

Google DeepMind

A revolutionary model that predicts the structure of proteins and molecular interactions with ultra-high accuracy.

Protein FoldingMolecular Interactioncommercial
More Details

ESMFold

Meta

A faster alternative to AlphaFold for protein structure prediction, based on language models.

Protein Structure Predictionmit
More Details

GraphCast

Google DeepMind

An AI model for weather forecasting that surpasses traditional methods in accuracy and speed.

Weather Forecastingapache-2.0
More Details

Med-PaLM 2

Google

A language model specialized in the medical field, for answering medical questions and summarizing health data.

Medical Q&AHealth Data Analysisresearch-access
More Details

BioGPT

Microsoft

A large language model pre-trained on biomedical literature for information extraction.

Biomedical Literature Miningmit
More Details

Galactica

Meta

A language model trained on scientific papers (was withdrawn but influential).

Scientific Literature ReviewFormula Generationmit (withdrawn)
More Details

RoseTTAFold

University of Washington

Another powerful model for protein structure prediction, considered a significant competitor to AlphaFold.

Protein Structure Predictionmit
More Details

FourCastNet

NVIDIA

A weather forecasting model based on computer vision, known for its extreme speed.

Weather Forecastingother
More Details

AlphaTensor

Google DeepMind

A reinforcement learning model that discovered new and faster algorithms for matrix multiplication, a fundamental computation.

Algorithm DiscoveryMatrix Multiplicationresearch-publication
More Details

GNoME

Google DeepMind

A model that discovered 2.2 million new crystal materials, significantly accelerating the process of materials discovery.

Materials DiscoveryCrystal Structure Predictionresearch-publication
More Details

ProGen

Salesforce Research

A language model for generating proteins with specific functions by learning from protein sequences.

Protein GenerationDrug Discoverymit
More Details

MolGPT

various

A modified language model for generating new molecules with desired chemical properties.

Molecule GenerationDrug Discoverymit
More Details

TripoSR

Stability AI & Tripo AI

A fast, open-source model for generating 3D models from a single image.

Image-to-3Dmit
More Details

Shap-E

OpenAI

A model for generating 3D models from text or images, producing implicit representations.

Text-to-3DImage-to-3Dmit
More Details

Point-E

OpenAI

A model for generating 3D models from text, focusing on creating a point cloud.

Text-to-3Dmit
More Details

DreamFusion

Google

A model that uses a 2D image generation model to generate coherent 3D scenes from text.

Text-to-3Dresearch-only
More Details

NeRF (Neural Radiance Fields)

UC Berkeley & Google

A foundational technique for generating novel views of complex 3D scenes from a small set of images.

Novel View Synthesis3D Scene Reconstructionmit
More Details

GET3D

NVIDIA

A model that generates high-quality 3D meshes with rich details and textures.

Text-to-3DHigh-quality 3D Synthesisnvidia-source-code
More Details

Gaussian Splatting

INRIA

A new technique for representing and rendering 3D scenes in real-time with high quality, considered a fast alternative to NeRF.

Real-time 3D RenderingNovel View Synthesismit
More Details

AlphaGo

Google DeepMind

The historic model that defeated the world champion in the game of Go, marking a milestone in AI.

Game Playing (Go)Reinforcement Learningresearch-publication
More Details

MuZero

Google DeepMind

An advanced model that learns the rules of the game itself and plans to achieve superhuman performance.

Self-taught Game PlayingModel-based RLresearch-publication
More Details

RT-2 (Robotics Transformer 2)

Google DeepMind

A vision-language-action model that transfers knowledge from the internet to robot control.

Vision-Language-Action ControlRoboticsresearch-publication
More Details

AlphaStar

Google DeepMind

A model that reached Grandmaster level in the strategically complex game of StarCraft II.

Real-time Strategy Game Playingresearch-publication
More Details

DreamerV3

Google DeepMind

A general reinforcement learning agent that can master a wide range of tasks with a fixed compute resource.

General Reinforcement Learningmit
More Details

Gato

Google DeepMind

A single general agent that can perform over 600 different tasks, including playing, chatting, and controlling a robotic arm.

Generalist AgentMultitask Learningresearch-publication
More Details

PPO (Proximal Policy Optimization)

OpenAI

A very popular reinforcement learning algorithm, used to train many agents due to its stability and ease of implementation.

Reinforcement Learningconceptual
More Details

PaLM-E

Google

A multimodal model that integrates vision and language into a single model for robot control.

Embodied ReasoningRoboticsresearch-publication
More Details

SayCan

Google

A model that connects language models with robotic capabilities, allowing robots to understand and execute high-level instructions.

RoboticsInstruction Followingresearch-publication
More Details

AlphaDev

Google DeepMind

A reinforcement learning model that discovered faster sorting algorithms, demonstrating its ability to improve fundamental code.

Algorithm DiscoveryCode Optimizationresearch-publication
More Details

DQN (Deep Q-Network)

Google DeepMind

The first deep reinforcement learning algorithm to achieve human-level performance on Atari games, a milestone in the field.

Reinforcement LearningGame Playing (Atari)conceptual
More Details

DeepAR

Amazon

A popular time series forecasting model that uses recurrent neural networks (RNN) to produce probabilistic forecasts.

Time Series Forecastingapache-2.0 (implementation)
More Details

N-BEATS

Element AI, MILA

A deep learning architecture for time series forecasting that achieves high performance without needing prior domain knowledge.

Time Series Forecastingmit
More Details

Temporal Fusion Transformer (TFT)

Google

A transformer-based model for multi-horizon time series forecasting, combining different types of inputs.

Interpretable Time Series Forecastingapache-2.0
More Details

LightGCN

National University of Singapore

A simplified and effective graph neural network model for recommendation, focusing on the neighborhood in the user-item graph.

Collaborative FilteringRecommendation Systemsmit
More Details

TimesNet

Tsinghua University

A new model for time series forecasting that discovers multiple periodic patterns in time data.

Time Series Forecastingmit
More Details

DCN V2 (Deep & Cross Network)

Google & Stanford

A popular recommendation model that combines deep networks and cross networks to effectively capture feature interactions.

Recommendation SystemsClick-Through Rate Predictionapache-2.0
More Details

TAPAS

Google AI

A BERT model modified to answer questions over tables, allowing for understanding of tabular data.

Table Question Answeringapache-2.0
More Details

ERNIE

Baidu

A series of language models from Baidu that focus on integrating knowledge from knowledge graphs.

Knowledge-enhanced NLPapache-2.0
More Details

PanGu-Σ

Huawei

A large language model from Huawei with strong performance in Chinese and general tasks.

Text GenerationChinese NLPother
More Details

HyperCLOVA X

Naver

A large language model from Naver (South Korea) designed for wide-scale applications.

Conversational AIEnterprise AIcommercial
More Details

GAN (Generative Adversarial Network)

University of Montreal

A foundational framework consisting of two networks (generator and discriminator) that compete to create realistic data, the basis for many generative models.

Generative ModelingImage Synthesisconceptual
More Details

VAE (Variational Autoencoder)

University of Amsterdam

A generative model that learns a latent representation of data, widely used in tasks like image generation and dimensionality reduction.

Generative ModelingDimensionality Reductionconceptual
More Details

TabNet

Google Research

A deep learning model for tabular data that uses sequential attention to select important features at each decision step.

Tabular Data ClassificationTabular Data Regressionapache-2.0
More Details

XGBoost

DMLC

An optimized, distributed, and high-performance gradient boosting library. Widely used in competitions and production.

ClassificationRegressionapache-2.0
More Details

LightGBM

Microsoft

A gradient boosting framework that uses histogram-based techniques for high speed and memory efficiency.

ClassificationRegressionmit
More Details

CatBoost

Yandex

A gradient boosting algorithm that handles categorical features automatically and efficiently.

ClassificationRegressionapache-2.0
More Details

Scikit-learn

Community

The essential Python library for machine learning, providing simple and effective tools for data mining and analysis.

ClassificationRegressionbsd-3-clause
More Details

Prophet

Meta

A library for forecasting time series data, designed to handle data with strong seasonal trends and holidays.

Time Series Forecastingmit
More Details

Isolation Forest

various

An effective algorithm for anomaly detection by isolating them instead of identifying normal regions.

Anomaly Detectionbsd-3-clause
More Details

SVM (Support Vector Machine)

various

A powerful classification algorithm that finds the best hyperplane that separates two classes of data.

ClassificationRegressionconceptual
More Details

BERT

Google

A foundational model that revolutionized natural language understanding through bidirectional attention.

Text ClassificationQuestion Answeringapache-2.0
More Details

T5 (Text-To-Text Transfer Transformer)

Google

A model that treats all NLP tasks as a "text-to-text" problem.

SummarizationTranslationapache-2.0
More Details

RoBERTa

Meta

A robust optimization of the BERT model, trained longer on more data.

Text ClassificationSentiment Analysismit
More Details

GPT-2

OpenAI

A model that showed surprising abilities in generating coherent text, its initial release was controversial.

Text Generationmit
More Details

Transformer

Google

The "Attention Is All You Need" paper that introduced the Transformer architecture, the basis for most modern models.

Machine TranslationSequence-to-sequencepaper
More Details

Word2Vec

Google

A foundational technique for creating "word embeddings," which capture semantic relationships between words.

Word EmbeddingsSemantic Similarityapache-2.0
More Details

AlexNet

University of Toronto

A convolutional neural network that won the 2012 ImageNet competition, sparking the modern deep learning revolution.

Image Classificationbsd-3-clause
More Details

ResNet

Microsoft Research

A deep neural network architecture that introduced "residual connections" to train much deeper networks.

Image ClassificationFeature Extractionmit
More Details

EfficientNet

Google

A family of models that uses a compound method to efficiently scale networks (depth, width, resolution).

Image Classificationapache-2.0
More Details

Stanford Alpaca

Stanford University

An influential project that showed a 7B Llama model could be fine-tuned on little data to achieve high performance.

Instruction Followingnon-commercial
More Details

Dolly 2.0

Databricks

The first open-source instruction-following model trained on a human-generated dataset licensed for commercial use.

Instruction FollowingChatmit
More Details

Guanaco

University of Washington

An improved version of Alpaca that uses the QLoRA technique to allow fine-tuning of large models on a single GPU.

Instruction Followingapache-2.0
More Details

Play.ht 2.0

Play.ht

An advanced commercial platform for generating very realistic voices and high-quality voice cloning.

Text-to-SpeechVoice Cloningcommercial
More Details

OPT (Open Pre-trained Transformer)

Meta

A series of open-source language models, released to help promote transparency and research in large models.

Text GenerationAI Researchother
More Details
⏱️