لوحة التحكم

لوحة تحكم الذكاء الاصطناعي 2025

تحليل تفاعلي للمشهد العالمي والعربي من ورشة الذكاء الاصطناعي

قاعدة بيانات النماذج

استكشف أبرز النماذج. استخدم الفلاتر لتخصيص بحثك في أكثر من 200 نموذج.

GPT-4o

OpenAI

A leading native multimodal model, integrating real-time text, audio, and image understanding for natural and fast human-like interaction.

Real-time ConversationText Generationcommercial

المزيد من التفاصيل

GPT-4 Turbo

OpenAI

An enhanced version of GPT-4 with a massive context window (128k tokens) and knowledge updated to April 2023.

Text GenerationCode Generationcommercial

المزيد من التفاصيل

Claude 3.5 Sonnet

Anthropic

A leading model that balances intelligence and speed, excellent for coding, data analysis, and computer vision.

Text GenerationCode Generationcommercial

المزيد من التفاصيل

Claude 3 Opus

Anthropic

The most powerful model in the Claude 3 family, designed for highly complex tasks requiring near-human intelligence.

Complex ReasoningAdvanced Analysiscommercial

المزيد من التفاصيل

Claude 3 Haiku

Anthropic

The fastest and smallest model in the Claude 3 family, ideal for instant interactions and customer service.

Live Chat SupportContent Moderationcommercial

المزيد من التفاصيل

Gemini 1.5 Pro

Google

A multimodal model with a massive context window (1 million tokens), ideal for analyzing huge amounts of data.

Text GenerationVisual Question Answeringcommercial

المزيد من التفاصيل

Gemini 1.5 Flash

Google

A lightweight and fast model from the Gemini 1.5 family, optimized for tasks requiring quick responses at scale.

SummarizationChat Applicationscommercial

المزيد من التفاصيل

Gemini 1.0 Ultra

Google

The first model in the Gemini family, designed for highly complex tasks and excelling at understanding nuances.

Advanced ReasoningMultimodal Understandingcommercial

المزيد من التفاصيل

Command R+

Cohere

A model designed for enterprise applications, focusing on reliable RAG, tool use, and multilingual performance.

Text GenerationRAGcommercial / cc-by-nc-4.0

المزيد من التفاصيل

Command R

Cohere

The base model of the Command family, balancing performance and cost for enterprise applications.

RAGTool Usecommercial / cc-by-nc-4.0

المزيد من التفاصيل

Mistral Large

Mistral AI

A leading language model from Mistral AI, featuring strong reasoning capabilities and high multilingual performance.

Text GenerationCode Generationcommercial

المزيد من التفاصيل

Grok-1.5

xAI

An improved version of Grok with enhanced reasoning capabilities and a longer context.

Real-world UnderstandingLong-context Reasoningcommercial-preview

المزيد من التفاصيل

Jurassic-2 (J2)

AI21 Labs

A family of language models (Jumbo, Grande, Large) focusing on text quality and precise instruction following.

Instruction FollowingText Generationcommercial

المزيد من التفاصيل

Inflection-2.5

Inflection AI

The model that powers the Pi assistant, designed for empathetic and helpful personal conversations.

Conversational AIEmotional Intelligencecommercial

المزيد من التفاصيل

Amazon Titan Text Premier

Amazon

Amazon's latest and largest language model, designed for advanced enterprise applications on the Bedrock platform.

Enterprise SearchSummarizationcommercial

المزيد من التفاصيل

Llama-3.1-405B

Meta

The most powerful open-source model, offering transparency and extensive customization at a large scale.

Text GenerationComplex Reasoningllama3.1

المزيد من التفاصيل

Llama-3.1-70B

Meta

A strong and balanced open-source model, providing excellent performance for a wide range of tasks.

Text GenerationCode Generationllama3.1

المزيد من التفاصيل

Llama-3.1-8B

Meta

The smallest Llama 3.1 model, ideal for experiments and applications requiring high efficiency.

Text GenerationInstruction Followingllama3.1

المزيد من التفاصيل

Mixtral 8x22B

Mistral AI

A high-performance open Mixture of Experts (MoE) model, using 8 experts for high efficiency.

Text GenerationCode Generationapache-2.0

المزيد من التفاصيل

Mixtral 8x7B

Mistral AI

The original MoE model that revolutionized open performance, fast and efficient.

Text GenerationReasoningapache-2.0

المزيد من التفاصيل

Mistral 7B

Mistral AI

A small but very powerful model, considered an excellent starting point for many fine-tuned models.

Text GenerationFine-tuningapache-2.0

المزيد من التفاصيل

Gemma 2 27B

Google

The second generation of Gemma models, offering leading performance in its size class with high efficiency.

Text GenerationReasoninggemma-2

المزيد من التفاصيل

Gemma 2 9B

Google

A mid-size version of Gemma 2, ideal for on-device applications.

On-device AIText Generationgemma-2

المزيد من التفاصيل

Jais-30B-v3

Core42

The highest quality Arabic language model, open-source and bilingual (Arabic/English).

Text Generationapache-2.0

المزيد من التفاصيل

Aya 23

Cohere For AI

A multilingual model covering 23 languages, developed as part of a broad community effort.

Multilingual Text Generationapache-2.0

المزيد من التفاصيل

Nemotron-4 340B

NVIDIA

A massive open-source model designed to generate synthetic data for training other models.

Synthetic Data GenerationText Generationnvidia-open-model

المزيد من التفاصيل

Phi-3 Family (SLM)

Microsoft

A family of open-source small language models (SLM) (3.8B, 7B, 14B), very efficient for on-device applications.

On-device AIVisual Question Answeringmit

المزيد من التفاصيل

Qwen2

Alibaba Cloud

The second generation of Qwen models, with significant improvements in performance and multilingual capabilities.

Text GenerationChatapache-2.0

المزيد من التفاصيل

DBRX

Databricks

A powerful open MoE model, showing excellent performance in coding and general tasks.

Code GenerationText Generationdatabricks-open-model

المزيد من التفاصيل

Falcon 180B

TII

One of the largest open models, developed in the UAE and known for its strong performance.

Text GenerationReasoningtii-falcon-llm

المزيد من التفاصيل

OLMo

AI2

A family of fully open-source models from the Allen Institute for AI, including training data and tools.

AI ResearchText Generationapache-2.0

المزيد من التفاصيل

Yi-1.5

01.AI

The new generation of Yi models, available in various sizes and featuring strong conversation and vision capabilities.

Text GenerationChatyi-license

المزيد من التفاصيل

OpenELM

Apple

A family of open and efficient language models from Apple, designed to work efficiently on-device.

On-device AIText Generationother

المزيد من التفاصيل

Mamba

Carnegie Mellon & Princeton

A new model architecture (State Space Model) aiming to compete with the Transformer architecture with higher efficiency.

Long-context ReasoningSequence Modelingapache-2.0

المزيد من التفاصيل

Vicuna

LMSYS

An open-source conversational model fine-tuned from Llama, known for its high quality in dialogue.

ChatbotConversational AIapache-2.0

المزيد من التفاصيل

Zephyr

Hugging Face

A series of fine-tuned models that focus on better intent and instruction following.

Instruction FollowingChatmit

المزيد من التفاصيل

BLOOM

BigScience

A massive multilingual language model, developed in an open collaborative effort involving over 1000 researchers.

Multilingual Text Generationbigscience-rail-v1

المزيد من التفاصيل

MPT-30B

MosaicML

An open-source model from MosaicML (now part of Databricks) featuring a long context window and high quality.

Long-context QASummarizationapache-2.0

المزيد من التفاصيل

Orca 2

Microsoft

A model that focuses on improving the reasoning abilities of small models by simulating the thought processes of larger models.

ReasoningInstruction Followingmit

المزيد من التفاصيل

XLM-RoBERTa

Meta

A very powerful multilingual model, pre-trained on 2.5 terabytes of data in 100 languages.

Cross-lingual UnderstandingMultilingual Classificationmit

المزيد من التفاصيل

WizardLM

Microsoft

A family of models fine-tuned using complex, self-generated instruction data, enhancing their reasoning capabilities.

Complex Instruction FollowingReasoningmit

المزيد من التفاصيل

OpenChat

A set of open-source conversational models fine-tuned with diverse data to achieve strong conversational performance.

ChatbotConversational AIapache-2.0

المزيد من التفاصيل

Cerebras-GPT

Cerebras

A set of open-source language models released to demonstrate the capabilities of Cerebras hardware.

Text Generationapache-2.0

المزيد من التفاصيل

OpenHermes

Teknium

One of the most popular fine-tuned models from Mistral, known for its high quality in conversation and coding.

ChatCode Generationmit

المزيد من التفاصيل

Dolphin

Eric Hartford

A family of uncensored fine-tuned models (usually from Llama or Mistral), aiming to provide direct answers.

Uncensored ChatCreative Writingother

المزيد من التفاصيل

SeaLLM

AI Singapore

A family of language models designed specifically for Southeast Asian languages and cultures.

Southeast Asian NLPMultilingual Chatapache-2.0

المزيد من التفاصيل

BELLE

Lianjia

An open-source Chinese project focusing on producing high-quality instruction data and fine-tuned models.

Chinese Instruction FollowingChatother

المزيد من التفاصيل

ChatGLM3

Tsinghua University

The third generation of open bilingual (Chinese/English) conversational models.

Bilingual ChatText Generationapache-2.0

المزيد من التفاصيل

InternLM2

Shanghai AI Lab

The second generation of InternLM models, with improved performance across various tasks.

Text GenerationChatapache-2.0

المزيد من التفاصيل

LEIA

Large European AI consortium

A European consortium aiming to build large, open-source, multilingual language models specific to Europe.

European Multilingual NLPin-development

المزيد من التفاصيل

GPT-SW3

AI Sweden

A set of large language models trained specifically for Nordic languages.

Nordic Language NLPapache-2.0

المزيد من التفاصيل

Nous-Hermes 2

Nous Research

One of the strongest fine-tuned models from Mistral and Llama, trained on high-quality synthetic data.

ChatReasoningmit

المزيد من التفاصيل

SOLAR-10.7B

Upstage

An open-source model that combines different models to achieve high performance while maintaining a relatively small size.

Text GenerationInstruction Followingapache-2.0

المزيد من التفاصيل

Stable Diffusion 3

Stability AI

A powerful open-source model for image generation, known for its flexibility and ability to understand complex text.

Text-to-ImageImage-to-Imageother

المزيد من التفاصيل

Stable Diffusion XL

Stability AI

An improved version of Stable Diffusion that provides higher image quality and better compositions.

High-resolution Image Generationother

المزيد من التفاصيل

Midjourney V6

Midjourney

A leading commercial model for generating artistic and realistic images, characterized by very high aesthetic quality.

Text-to-ImageArt Generationcommercial

المزيد من التفاصيل

DALL-E 3

OpenAI

The image generation model from OpenAI, integrated into ChatGPT and Bing, focusing on ease of use.

Text-to-Imagecommercial

المزيد من التفاصيل

Imagen 3

Google

Google's latest model for image generation, designed to create realistic and creative images with a deep understanding of details.

Text-to-ImagePhotorealismcommercial

المزيد من التفاصيل

Ideogram 1.0

Ideogram AI

A model specialized in generating images that contain readable and clear text.

Text-to-ImageTypography Generationcommercial

المزيد من التفاصيل

LLaVA-NeXT

various

A leading open-source vision-language model, combining large language models with image understanding capabilities for conversation about them.

Visual Instruction TuningVisual Chatapache-2.0

المزيد من التفاصيل

Idefics2

Hugging Face

An open-source vision-language model from Hugging Face, known for its efficiency in handling images and text.

Visual Question AnsweringImage Descriptionapache-2.0

المزيد من التفاصيل

Florence-2

Microsoft

A unified and lightweight vision model that can handle a variety of tasks.

Object DetectionImage Captioningmit

المزيد من التفاصيل

YOLOv10

Tsinghua University

Real-time object detection with high efficiency and excellent accuracy.

Real-time Object Detectionagpl-3.0

المزيد من التفاصيل

YOLOv8

Ultralytics

One of the most common and widely used versions of YOLO, known for its ease of use and strong performance.

Real-time Object DetectionImage Segmentationagpl-3.0

المزيد من التفاصيل

YOLO-NAS

Deci AI

A new YOLO architecture found using Neural Architecture Search (NAS), to achieve an optimal balance between accuracy and speed.

Real-time Object Detectionother

المزيد من التفاصيل

ControlNet

Stanford University

A method for controlling image generation models by adding conditions such as poses or image edges.

Conditional Image Generationapache-2.0

المزيد من التفاصيل

CLIP

OpenAI

A foundational model that connects text and images, allowing for tasks like text-based image search and classification.

Zero-shot ClassificationImage-Text Matchingmit

المزيد من التفاصيل

Segment Anything (SAM)

Meta

An image segmentation model that can identify and isolate any object in any image with a single click.

Image SegmentationObject Maskingapache-2.0

المزيد من التفاصيل

Vision Transformer (ViT)

Google

A foundational model that showed the Transformer architecture can excel at computer vision tasks without needing CNNs.

Image Classificationapache-2.0

المزيد من التفاصيل

DeepFloyd IF

Stability AI

An image generation model that relies on diffusion in pixel space, allowing for high quality and excellent text generation capability.

Text-to-ImageTypography Generationother

المزيد من التفاصيل

Kandinsky 3.0

Sber AI & others

The third generation of the image generation model, with improvements in quality and instruction-following ability.

Text-to-ImageImage Inpaintingother

المزيد من التفاصيل

StyleGAN3

NVIDIA

The third generation of StyleGAN, designed to create high-quality images with precise control over style.

High-fidelity Image Synthesisnvidia-source-code

المزيد من التفاصيل

ESRGAN

various

A popular model for image super-resolution, capable of adding realistic details.

Image Super-Resolutionapache-2.0

المزيد من التفاصيل

DETR (DEtection TRansformer)

Meta

A model that treats object detection as a direct set prediction problem.

Object DetectionPanoptic Segmentationapache-2.0

المزيد من التفاصيل

Swin Transformer

Microsoft Research

A hierarchical vision transformer architecture that achieves high efficiency by calculating attention within shifted windows.

Image ClassificationObject Detectionmit

المزيد من التفاصيل

DINOv2

Meta

A vision model that learns strong image representations through self-supervised learning, making it excellent for downstream tasks.

Self-supervised LearningFeature Extractionapache-2.0

المزيد من التفاصيل

OpenPose

Carnegie Mellon University

A popular and influential library for real-time multi-person body, face, hand, and foot pose estimation.

Pose EstimationKeypoint Detectionother

المزيد من التفاصيل

SwinIR

various

An image restoration model based on Swin Transformer that achieves excellent results in tasks like denoising and super-resolution.

Image Super-ResolutionImage Denoisingmit

المزيد من التفاصيل

Mask R-CNN

Meta

A highly influential and foundational architecture for instance segmentation, which predicts a mask for each detected object.

Instance SegmentationObject Detectionmit

المزيد من التفاصيل

U-Net

University of Freiburg

A convolutional neural network architecture originally developed for biomedical image segmentation, which has become a standard in the field.

Biomedical Image SegmentationSemantic Segmentationconceptual

المزيد من التفاصيل

CogVLM

Tsinghua University

A powerful open-source vision-language model, featuring excellent capabilities in complex visual conversation.

Visual ChatVisual Groundingapache-2.0

المزيد من التفاصيل

BLIP-2

Salesforce Research

An efficient architecture for training vision-language models, achieving excellent performance on tasks like captioning and visual question answering.

Image CaptioningVisual Question Answeringbsd-3-clause

المزيد من التفاصيل

FastSAM

CASIA

A fast alternative to the original SAM model, achieving similar results at a much higher speed, making it suitable for real-time applications.

Real-time Image Segmentationapache-2.0

المزيد من التفاصيل

Sora

OpenAI

An advanced text-to-video model, capable of creating realistic and complex scenes up to a minute long.

Text-to-Videocommercial

المزيد من التفاصيل

KLING

Kuaishou

A competing Chinese model for generating realistic, high-definition (1080p) videos up to two minutes long.

Text-to-VideoImage-to-Videocommercial

المزيد من التفاصيل

Lumiere

Google

A text-to-video model that uses a Space-Time U-Net architecture to generate realistic and coherent motion.

Text-to-VideoVideo Stylizationcommercial

المزيد من التفاصيل

Pika 1.0

Pika Labs

An easy-to-use platform for generating and editing video with AI.

Text-to-VideoImage-to-Videocommercial

المزيد من التفاصيل

Runway Gen-2

RunwayML

One of the leading models in the market for generating video from text or images.

Text-to-VideoImage-to-Videocommercial

المزيد من التفاصيل

Stable Video Diffusion

Stability AI

An open-source model for generating short video clips from images.

Image-to-Videoother

المزيد من التفاصيل

Make-A-Video

Meta

One of the early and influential models in the text-to-video space.

Text-to-Videoresearch-only

المزيد من التفاصيل

VEO

Google

Google's latest model for generating high-quality (1080p+) video with a deep understanding of language and cinematic concepts.

High-quality Text-to-Videocommercial-preview

المزيد من التفاصيل

Whisper-large-v3

OpenAI

An accurate speech-to-text model, supporting multiple languages with great proficiency and efficiency.

Automatic Speech Recognitionmit

المزيد من التفاصيل

Suno V3

Suno AI

An advanced model for generating full songs and music (including vocals) from a simple text description.

Text-to-MusicSong Generationcommercial

المزيد من التفاصيل

Udio

Udio AI

A powerful model for generating high-quality music and songs, considered a major competitor to Suno.

Text-to-MusicSong Generationcommercial

المزيد من التفاصيل

MusicGen

Meta

A model for generating high-quality music clips from a text description.

Text-to-MusicMelody Conditioningcc-by-nc-4.0

المزيد من التفاصيل

XTTS-v2

Coqui AI

A high-quality text-to-speech model with voice cloning across multiple languages.

Text-to-SpeechVoice Cloningapache-2.0

المزيد من التفاصيل

ElevenLabs TTS

ElevenLabs

A leading platform for realistic speech generation and voice cloning, known for its extremely natural sound quality.

Text-to-SpeechVoice Cloningcommercial

المزيد من التفاصيل

Bark

Suno AI

An open-source text-to-speech model, capable of generating very realistic voices with non-verbal cues.

Expressive Text-to-Speechmit

المزيد من التفاصيل

VALL-E X

Microsoft

A text-to-speech model capable of cloning a person's voice from a short sample (3 seconds).

Zero-shot TTSVoice Cloningresearch-only

المزيد من التفاصيل

SeamlessM4T v2

Meta

A comprehensive translation model supporting speech-to-speech, speech-to-text, and text-to-speech across 100 languages.

Speech-to-Speech TranslationASRcc-by-nc-4.0

المزيد من التفاصيل

Wav2Vec 2.0

Meta

A foundational model for learning speech representations from unlabeled data, widely used in speech recognition.

Self-supervised Speech RepresentationASRmit

المزيد من التفاصيل

Tacotron 2

Google

A neural network architecture for generating speech directly from text, which is the basis for many modern TTS systems.

Text-to-Speech Synthesisapache-2.0

المزيد من التفاصيل

Riffusion

A model that uses Stable Diffusion to generate spectrograms and then converts them into music.

Text-to-MusicImage-to-Musicmit

المزيد من التفاصيل

Jukebox

OpenAI

An early and powerful model for generating music with rudimentary vocals, known for its high quality but very slow speed.

Music GenerationSinging Voice Synthesismit

المزيد من التفاصيل

MMS (Massively Multilingual Speech)

Meta

A massive project providing models for speech recognition, generation, and language identification for over 1100 languages.

Multilingual ASRMultilingual TTScc-by-nc-4.0

المزيد من التفاصيل

VITS

various

A popular speech generation architecture that combines VAEs and GANs to efficiently generate high-quality speech.

High-quality Text-to-Speechmit

المزيد من التفاصيل

CLAP

Microsoft

A model that connects text and audio, allowing for tasks like text-based audio search and classification.

Zero-shot Audio ClassificationAudio-Text Matchingmit

المزيد من التفاصيل

StyleTTS 2

various

A text-to-speech model that achieves very high quality with precise style control without needing a reference voice.

Expressive Text-to-SpeechStyle Diffusionmit

المزيد من التفاصيل

AlphaCode 2

Google DeepMind

An advanced code generation model that solves competitive programming problems at a level comparable to top programmers.

Competitive ProgrammingComplex Code Generationresearch-access

المزيد من التفاصيل

GitHub Copilot

GitHub (Microsoft)

The most popular AI-powered programming assistant, using OpenAI models to provide smart code suggestions.

Code CompletionCode Generationcommercial

المزيد من التفاصيل

Code Llama 70B

Meta

A model specialized in generating and understanding code, one of the most powerful open models in this field.

Code GenerationCode Completionllama2

المزيد من التفاصيل

DeepSeek-Coder-V2

DeepSeek AI

A very powerful open-source model specialized in programming, supporting 338 programming languages.

Code GenerationCode Completionmit

المزيد من التفاصيل

CodeGemma

Google

A family of open and lightweight models specialized in efficient code completion and generation.

Code GenerationFill-in-the-Middlegemma

المزيد من التفاصيل

StarCoder 2

BigCode (ServiceNow & Hugging Face)

The second generation of StarCoder models, trained on a massive amount of code.

Code GenerationCode Assistantbigcode-openrail-m

المزيد من التفاصيل

CodeGen2

Salesforce

An open-source model for code generation, capable of working with multiple programming languages.

Program SynthesisCode Generationapache-2.0

المزيد من التفاصيل

WizardCoder

WizardLM Team

A Llama model fine-tuned on a large programming dataset, achieving excellent performance.

Code GenerationInstruction Followingmit

المزيد من التفاصيل

Phind-Coder

Phind

An open-source programming model fine-tuned on high-quality data, known for its speed and accuracy.

Code GenerationDeveloper Q&Aother

المزيد من التفاصيل

AlphaFold 3

Google DeepMind

A revolutionary model that predicts the structure of proteins and molecular interactions with ultra-high accuracy.

Protein FoldingMolecular Interactioncommercial

المزيد من التفاصيل

ESMFold

Meta

A faster alternative to AlphaFold for protein structure prediction, based on language models.

Protein Structure Predictionmit

المزيد من التفاصيل

GraphCast

Google DeepMind

An AI model for weather forecasting that surpasses traditional methods in accuracy and speed.

Weather Forecastingapache-2.0

المزيد من التفاصيل

Med-PaLM 2

Google

A language model specialized in the medical field, for answering medical questions and summarizing health data.

Medical Q&AHealth Data Analysisresearch-access

المزيد من التفاصيل

BioGPT

Microsoft

A large language model pre-trained on biomedical literature for information extraction.

Biomedical Literature Miningmit

المزيد من التفاصيل

Galactica

Meta

A language model trained on scientific papers (was withdrawn but influential).

Scientific Literature ReviewFormula Generationmit (withdrawn)

المزيد من التفاصيل

RoseTTAFold

University of Washington

Another powerful model for protein structure prediction, considered a significant competitor to AlphaFold.

Protein Structure Predictionmit

المزيد من التفاصيل

FourCastNet

NVIDIA

A weather forecasting model based on computer vision, known for its extreme speed.

Weather Forecastingother

المزيد من التفاصيل

AlphaTensor

Google DeepMind

A reinforcement learning model that discovered new and faster algorithms for matrix multiplication, a fundamental computation.

Algorithm DiscoveryMatrix Multiplicationresearch-publication

المزيد من التفاصيل

GNoME

Google DeepMind

A model that discovered 2.2 million new crystal materials, significantly accelerating the process of materials discovery.

Materials DiscoveryCrystal Structure Predictionresearch-publication

المزيد من التفاصيل

ProGen

Salesforce Research

A language model for generating proteins with specific functions by learning from protein sequences.

Protein GenerationDrug Discoverymit

المزيد من التفاصيل

MolGPT

various

A modified language model for generating new molecules with desired chemical properties.

Molecule GenerationDrug Discoverymit

المزيد من التفاصيل

TripoSR

Stability AI & Tripo AI

A fast, open-source model for generating 3D models from a single image.

Image-to-3Dmit

المزيد من التفاصيل

Shap-E

OpenAI

A model for generating 3D models from text or images, producing implicit representations.

Text-to-3DImage-to-3Dmit

المزيد من التفاصيل

Point-E

OpenAI

A model for generating 3D models from text, focusing on creating a point cloud.

Text-to-3Dmit

المزيد من التفاصيل

DreamFusion

Google

A model that uses a 2D image generation model to generate coherent 3D scenes from text.

Text-to-3Dresearch-only

المزيد من التفاصيل

NeRF (Neural Radiance Fields)

UC Berkeley & Google

A foundational technique for generating novel views of complex 3D scenes from a small set of images.

Novel View Synthesis3D Scene Reconstructionmit

المزيد من التفاصيل

GET3D

NVIDIA

A model that generates high-quality 3D meshes with rich details and textures.

Text-to-3DHigh-quality 3D Synthesisnvidia-source-code

المزيد من التفاصيل

Gaussian Splatting

INRIA

A new technique for representing and rendering 3D scenes in real-time with high quality, considered a fast alternative to NeRF.

Real-time 3D RenderingNovel View Synthesismit

المزيد من التفاصيل

AlphaGo

Google DeepMind

The historic model that defeated the world champion in the game of Go, marking a milestone in AI.

Game Playing (Go)Reinforcement Learningresearch-publication

المزيد من التفاصيل

MuZero

Google DeepMind

An advanced model that learns the rules of the game itself and plans to achieve superhuman performance.

Self-taught Game PlayingModel-based RLresearch-publication

المزيد من التفاصيل

RT-2 (Robotics Transformer 2)

Google DeepMind

A vision-language-action model that transfers knowledge from the internet to robot control.

Vision-Language-Action ControlRoboticsresearch-publication

المزيد من التفاصيل

AlphaStar

Google DeepMind

A model that reached Grandmaster level in the strategically complex game of StarCraft II.

Real-time Strategy Game Playingresearch-publication

المزيد من التفاصيل

DreamerV3

Google DeepMind

A general reinforcement learning agent that can master a wide range of tasks with a fixed compute resource.

General Reinforcement Learningmit

المزيد من التفاصيل

Gato

Google DeepMind

A single general agent that can perform over 600 different tasks, including playing, chatting, and controlling a robotic arm.

Generalist AgentMultitask Learningresearch-publication

المزيد من التفاصيل

PPO (Proximal Policy Optimization)

OpenAI

A very popular reinforcement learning algorithm, used to train many agents due to its stability and ease of implementation.

Reinforcement Learningconceptual

المزيد من التفاصيل

PaLM-E

Google

A multimodal model that integrates vision and language into a single model for robot control.

Embodied ReasoningRoboticsresearch-publication

المزيد من التفاصيل

SayCan

Google

A model that connects language models with robotic capabilities, allowing robots to understand and execute high-level instructions.

RoboticsInstruction Followingresearch-publication

المزيد من التفاصيل

AlphaDev

Google DeepMind

A reinforcement learning model that discovered faster sorting algorithms, demonstrating its ability to improve fundamental code.

Algorithm DiscoveryCode Optimizationresearch-publication

المزيد من التفاصيل

DQN (Deep Q-Network)

Google DeepMind

The first deep reinforcement learning algorithm to achieve human-level performance on Atari games, a milestone in the field.

Reinforcement LearningGame Playing (Atari)conceptual

المزيد من التفاصيل

DeepAR

Amazon

A popular time series forecasting model that uses recurrent neural networks (RNN) to produce probabilistic forecasts.

Time Series Forecastingapache-2.0 (implementation)

المزيد من التفاصيل

N-BEATS

Element AI, MILA

A deep learning architecture for time series forecasting that achieves high performance without needing prior domain knowledge.

Time Series Forecastingmit

المزيد من التفاصيل

Temporal Fusion Transformer (TFT)

Google

A transformer-based model for multi-horizon time series forecasting, combining different types of inputs.

Interpretable Time Series Forecastingapache-2.0

المزيد من التفاصيل

LightGCN

National University of Singapore

A simplified and effective graph neural network model for recommendation, focusing on the neighborhood in the user-item graph.

Collaborative FilteringRecommendation Systemsmit

المزيد من التفاصيل

TimesNet

Tsinghua University

A new model for time series forecasting that discovers multiple periodic patterns in time data.

Time Series Forecastingmit

المزيد من التفاصيل

DCN V2 (Deep & Cross Network)

Google & Stanford

A popular recommendation model that combines deep networks and cross networks to effectively capture feature interactions.

Recommendation SystemsClick-Through Rate Predictionapache-2.0

المزيد من التفاصيل

TAPAS

Google AI

A BERT model modified to answer questions over tables, allowing for understanding of tabular data.

Table Question Answeringapache-2.0

المزيد من التفاصيل

ERNIE

Baidu

A series of language models from Baidu that focus on integrating knowledge from knowledge graphs.

Knowledge-enhanced NLPapache-2.0

المزيد من التفاصيل

PanGu-Σ

Huawei

A large language model from Huawei with strong performance in Chinese and general tasks.

Text GenerationChinese NLPother

المزيد من التفاصيل

HyperCLOVA X

Naver

A large language model from Naver (South Korea) designed for wide-scale applications.

Conversational AIEnterprise AIcommercial

المزيد من التفاصيل

GAN (Generative Adversarial Network)

University of Montreal

A foundational framework consisting of two networks (generator and discriminator) that compete to create realistic data, the basis for many generative models.

Generative ModelingImage Synthesisconceptual

المزيد من التفاصيل

VAE (Variational Autoencoder)

University of Amsterdam

A generative model that learns a latent representation of data, widely used in tasks like image generation and dimensionality reduction.

Generative ModelingDimensionality Reductionconceptual

المزيد من التفاصيل

TabNet

Google Research

A deep learning model for tabular data that uses sequential attention to select important features at each decision step.

Tabular Data ClassificationTabular Data Regressionapache-2.0

المزيد من التفاصيل

XGBoost

DMLC

An optimized, distributed, and high-performance gradient boosting library. Widely used in competitions and production.

ClassificationRegressionapache-2.0

المزيد من التفاصيل

LightGBM

Microsoft

A gradient boosting framework that uses histogram-based techniques for high speed and memory efficiency.

ClassificationRegressionmit

المزيد من التفاصيل

CatBoost

Yandex

A gradient boosting algorithm that handles categorical features automatically and efficiently.

ClassificationRegressionapache-2.0

المزيد من التفاصيل

Scikit-learn

Community

The essential Python library for machine learning, providing simple and effective tools for data mining and analysis.

ClassificationRegressionbsd-3-clause

المزيد من التفاصيل

Prophet

Meta

A library for forecasting time series data, designed to handle data with strong seasonal trends and holidays.

Time Series Forecastingmit

المزيد من التفاصيل

Isolation Forest

various

An effective algorithm for anomaly detection by isolating them instead of identifying normal regions.

Anomaly Detectionbsd-3-clause

المزيد من التفاصيل

SVM (Support Vector Machine)

various

A powerful classification algorithm that finds the best hyperplane that separates two classes of data.

ClassificationRegressionconceptual

المزيد من التفاصيل

BERT

Google

A foundational model that revolutionized natural language understanding through bidirectional attention.

Text ClassificationQuestion Answeringapache-2.0

المزيد من التفاصيل

T5 (Text-To-Text Transfer Transformer)

Google

A model that treats all NLP tasks as a "text-to-text" problem.

SummarizationTranslationapache-2.0

المزيد من التفاصيل

RoBERTa

Meta

A robust optimization of the BERT model, trained longer on more data.

Text ClassificationSentiment Analysismit

المزيد من التفاصيل

GPT-2

OpenAI

A model that showed surprising abilities in generating coherent text, its initial release was controversial.

Text Generationmit

المزيد من التفاصيل

Transformer

Google

The "Attention Is All You Need" paper that introduced the Transformer architecture, the basis for most modern models.

Machine TranslationSequence-to-sequencepaper

المزيد من التفاصيل

Word2Vec

Google

A foundational technique for creating "word embeddings," which capture semantic relationships between words.

Word EmbeddingsSemantic Similarityapache-2.0

المزيد من التفاصيل

AlexNet

University of Toronto

A convolutional neural network that won the 2012 ImageNet competition, sparking the modern deep learning revolution.

Image Classificationbsd-3-clause

المزيد من التفاصيل

ResNet

Microsoft Research

A deep neural network architecture that introduced "residual connections" to train much deeper networks.

Image ClassificationFeature Extractionmit

المزيد من التفاصيل

EfficientNet

Google

A family of models that uses a compound method to efficiently scale networks (depth, width, resolution).

Image Classificationapache-2.0

المزيد من التفاصيل

Stanford Alpaca

Stanford University

An influential project that showed a 7B Llama model could be fine-tuned on little data to achieve high performance.

Instruction Followingnon-commercial

المزيد من التفاصيل

Dolly 2.0

Databricks

The first open-source instruction-following model trained on a human-generated dataset licensed for commercial use.

Instruction FollowingChatmit

المزيد من التفاصيل

Guanaco

University of Washington

An improved version of Alpaca that uses the QLoRA technique to allow fine-tuning of large models on a single GPU.

Instruction Followingapache-2.0

المزيد من التفاصيل

Play.ht 2.0

Play.ht

An advanced commercial platform for generating very realistic voices and high-quality voice cloning.

Text-to-SpeechVoice Cloningcommercial

المزيد من التفاصيل

OPT (Open Pre-trained Transformer)

Meta

A series of open-source language models, released to help promote transparency and research in large models.

Text GenerationAI Researchother

المزيد من التفاصيل