February
+ +Midjourney v1
+ +diff --git a/Gemfile b/Gemfile new file mode 100644 index 0000000..d31a08b --- /dev/null +++ b/Gemfile @@ -0,0 +1,2 @@ +source "https://rubygems.org" +gem "jekyll" \ No newline at end of file diff --git a/Gemfile.lock b/Gemfile.lock new file mode 100644 index 0000000..254b08d --- /dev/null +++ b/Gemfile.lock @@ -0,0 +1,74 @@ +GEM + remote: https://rubygems.org/ + specs: + addressable (2.8.7) + public_suffix (>= 2.0.2, < 7.0) + bigdecimal (3.1.8) + colorator (1.1.0) + concurrent-ruby (1.3.4) + em-websocket (0.5.3) + eventmachine (>= 0.12.9) + http_parser.rb (~> 0) + eventmachine (1.2.7) + ffi (1.17.0-x64-mingw-ucrt) + forwardable-extended (2.6.0) + google-protobuf (4.29.0-x64-mingw-ucrt) + bigdecimal + rake (>= 13) + http_parser.rb (0.8.0) + i18n (1.14.6) + concurrent-ruby (~> 1.0) + jekyll (4.3.4) + addressable (~> 2.4) + colorator (~> 1.0) + em-websocket (~> 0.5) + i18n (~> 1.0) + jekyll-sass-converter (>= 2.0, < 4.0) + jekyll-watch (~> 2.0) + kramdown (~> 2.3, >= 2.3.1) + kramdown-parser-gfm (~> 1.0) + liquid (~> 4.0) + mercenary (>= 0.3.6, < 0.5) + pathutil (~> 0.9) + rouge (>= 3.0, < 5.0) + safe_yaml (~> 1.0) + terminal-table (>= 1.8, < 4.0) + webrick (~> 1.7) + jekyll-sass-converter (3.0.0) + sass-embedded (~> 1.54) + jekyll-watch (2.2.1) + listen (~> 3.0) + kramdown (2.5.1) + rexml (>= 3.3.9) + kramdown-parser-gfm (1.1.0) + kramdown (~> 2.0) + liquid (4.0.4) + listen (3.9.0) + rb-fsevent (~> 0.10, >= 0.10.3) + rb-inotify (~> 0.9, >= 0.9.10) + mercenary (0.4.0) + pathutil (0.16.2) + forwardable-extended (~> 2.6) + public_suffix (6.0.1) + rake (13.2.1) + rb-fsevent (0.11.2) + rb-inotify (0.11.1) + ffi (~> 1.0) + rexml (3.3.9) + rouge (4.5.1) + safe_yaml (1.0.5) + sass-embedded (1.81.0-x64-mingw-ucrt) + google-protobuf (~> 4.28) + terminal-table (3.0.2) + unicode-display_width (>= 1.1.1, < 3) + unicode-display_width (2.6.0) + webrick (1.9.0) + +PLATFORMS + x64-mingw-ucrt + +DEPENDENCIES + jekyll + +BUNDLED WITH + 2.5.23 diff --git a/_data/links.yml b/_data/links.yml new file mode 100644 index 0000000..8a718cc --- /dev/null +++ b/_data/links.yml @@ -0,0 +1,19 @@ +- text: The Latest AI Innovations + url: https://www.futurepedia.io/ai-innovations +- text: Artificial Intelligence Index + url: https://aiindex.stanford.edu +- text: 'Generative AI Timeline: 9 Decades of Notable Milestones' + url: https://www.cmswire.com/digital-experience/generative-ai-timeline-9-decades-of-notable-milestones/ +- text: The History of Artificial Intelligence - Harvard + url: https://sitn.hms.harvard.edu/flash/2017/history-artificial-intelligence/ +- text: Analyzing 2023's Milestones and Forecasting 2024's Trends + url: https://masterofcode.com/blog/ai-highlights-2024 +- text: Timeline of Artificial Intelligence - Wikipedia + url: https://en.wikipedia.org/wiki/Timeline_of_artificial_intelligence +- text: The History of Artificial Intelligence - Complete AI Timeline + url: https://www.techtarget.com/searchenterpriseai/tip/The-history-of-artificial-intelligence-Complete-AI-timeline +- text: 'From the World Wide Web to AI: 11 Technology Milestones That Changed Our + Lives' + url: https://www.weforum.org/agenda/2024/03/11-technology-milestones-ai-quantum-computing-vr/ +- text: 'Artificial Intelligence (AI) and ChatGPT: History and Timelines' + url: https://www.officetimeline.com/blog/artificial-intelligence-ai-and-chatgpt-history-and-timelines diff --git a/_data/timeline.yml b/_data/timeline.yml new file mode 100644 index 0000000..6619ed2 --- /dev/null +++ b/_data/timeline.yml @@ -0,0 +1,302 @@ +- year: 2022 + events: + - date: February + info: + - text: Midjourney v1 + - date: April + info: + - text: Midjourney v2 + - text: DALL-E 2 is announced for gradual release. + special: true + - date: July + info: + - text: Midjourney v3 is launched. + - date: August + info: + - text: Stable Diffusion 1.4 is released. + - date: October + info: + - text: Stable Diffusion 1.5 becomes available. + - date: November + info: + - text: Midjourney v4 is released. + - text: Stable Diffusion 2.0 is launched. + - text: ChatGPT 3.5, a large language model by OpenAI, is released + to the public and quickly becomes a viral sensation. + special: true + - date: December + info: + - text: Stable Diffusion 2.1 is released. +- year: 2023 + events: + - date: February + info: + - text: Meta releases the LLaMA language model as open-source + for research purposes. The model is later leaked. + special: true + - text: Microsoft gradually releases Bing AI, an AI chat based + on an upgraded GPT model integrating internet search. + - date: March + info: + - text: Midjourney v5 is launched. + - text: OpenAI's GPT-4 model is partially released, featuring + multimodal image analysis and improved multi-language support. + - text: Google releases the AI chat Bard in a limited capacity, + based on the LaMDA language model. + - date: April + info: + - text: Adobe releases the Firefly image creation model as a + beta version to a waiting list. The model allowed a variety of capabilities + including text formatting. + - date: May + info: + - text: Midjourney v5.1 is released. + - text: Google announces an upgrade to Bard, moving it to the upgraded PaLM + 2 language model. It will support 180 countries and many languages. + - date: June + info: + - text: Midjourney v5.2 is launched. + - date: July + info: + - text: Stable Diffusion XL 1.0 is released. + - text: Anthropic announces a new version of their large language model - Claude + 2. + - text: Meta releases the LLaMA 2 open source language model + to the general public in a variety of sizes. + - date: October + info: + - text: DALL-E 3 is released. + - text: Adobe releases Firefly 2. + - date: November + info: + - text: Stable Diffusion XL Turbo is released - A fast model + that allows the creation of an image in one step in real time. + - date: December + info: + - text: Midjourney v6 is launched. + - text: Google upgrades Bard in limited areas, moving it to be based on the upgraded + Gemini Pro language model. + - text: X Corporation launches Grok AI chatbot for paid subscribers + in English language. +- year: 2024 + events: + - date: February + info: + - text: Stability AI announces Stable Diffusion 3 (gradually + released to waiting list). + - text: Google upgrades the artificial intelligence chat in Bard, basing it on + the new Gemini Pro model, in all available languages. Google + replaces "Bard" with "Gemini". + - text: Google announces the Gemini Pro 1.5 multimodal language + model capable of parsing up to a million tokens, as well as parsing video + and images. The model is gradually released to developers on a waiting list. + special: true + - text: OpenAI announces the Sora model that produces videos + up to a minute long. The model is not released to the public at this time. + special: true + - date: March + info: + - text: X Corporation announces the upcoming release of the Grok 1.5 + open source model. + - text: Anthropic announces Claude 3, a new version of their + large language model. The version is deployed in 3 different sizes, with the + largest model performing better than GPT-4. + - text: Suno AI, which develops a model for creating music, releases Suno + v3 to the general public. + - date: April + info: + - text: Stability AI releases a new update to the music creation model - Stable + Audio 2.0. + - text: X Corporation releases an upgrade to its language model, Grok-1.5V, + which integrates high-level image recognition. In the test presented by the + company, the model is the best in identifying and analyzing images compared + to other models. + - text: The Mistral company releases its new model Mixtral 8x22B + as open source. This is the most powerful model among the open source models + and it contains 141 billion parameters but uses a method that allows more + economical use. + - text: Meta releases the LLaMA 3 model as open source in sizes + 8B and 70B parameters. The large model shows better performance than Claude + 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later + release larger models with 400 billion parameters and more. + - text: Microsoft releases the Phi-3-mini model in open source. + The model comes in a reduced version of 3.8B parameters, which allows it to + run on mobile devices as well, and it presents capabilities similar to GPT-3.5. + special: true + - text: Adobe announces its new image creation model Firefly 3. + - text: The startup Reka AI presents a series of multimodal language + models in 3 sizes. The models are capable of processing video, audio and images. + The large model featured similar capabilities to GPT-4. + - text: Apple releases as full open source a series of small language models under + the name OpenELM. The models are available in four weights + between 270 million and 3 billion parameters. + - date: May + info: + - text: OpenAI announces the GPT-4-O model that presents full + multimodal capabilities, including receiving and creating text, images, and + audio. The model presents an impressive ability to speak with a high response + speed and in natural language. The model is 2 times more efficient than the + GPT-4 Turbo model, and has better capabilities for languages other than English. + special: true + - text: 'Google announces a large number of AI features in its products. The main + ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, + releasing a smaller and faster Gemini Flash 1.5 model. Revealing + the latest image creation model Imagen 3, music creation + model Music AI and video creation model Veo. + And the announcement of the Astra model with multimodal capabilities + for realtime audio and video reception.' + - text: 'Microsoft announces Copilot+ for dedicated computers, + which will allow a full search of the user''s history through screenshots + of the user''s activity. The company also released as open source the SLMs + that display impressive capabilities in a minimal size: Phi-3 Small, + Phi-3 Medium, and Phi-3 Vision which includes + image recognition capability.' + - text: Meta introduces Chameleon, a new multimodal model that + seamlessly renders text and images. + - text: Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3. + - text: Google announces AI Overviews intended to give a summary + of the relevant information in Google search. + special: true + - text: Suno AI releases an updated music creation model Suno v3.5. + - text: Mistral AI releases a new language model designed for coding Codestral + in size 22B. + - date: June + info: + - text: Stability AI releases its updated image creation model Stable + Diffusion 3 in a medium version in size 2B parameters. + - text: Apple announces Apple Intelligence, an AI system that + will be integrated into the company's devices and will combine AI models of + different sizes for different tasks. + - text: DeepSeekAI publishes the DeepSeekCoderV2 open source + language model which presents similar coding capabilities to models such as + GPT-4, Claude 3 Opus and more. + - text: Runway introduces Gen3 Alpha, a new + AI model for video generation. + - text: Anthropic releases the Claude Sonnet 3.5 model, which + presents better capabilities than other models with low resource usage. + special: true + - text: Microsoft releases in open source a series of image recognition models + called Florence 2. + - text: Google announces Gemma 2 open source language models + with 9B and 27B parameter sizes. Also, the company opens the context window + capabilities to developers for up to 2 million tokens. + - date: July + info: + - text: OpenAI has released a miniaturized model called gpt4o mini + that presents high capabilities at a low cost + - text: Meta releases as open source the llama 3.1 model in sizes + 8B 70B and 405B. The large model features the same capabilities as the best + closed source models + special: true + - text: 'mistral ai releases three new models: Codestral Mamba, + Mistral NeMo and Mathstral designed for + mathematics' + - text: Google DeepMind has unveiled two new AI systems that won silver medals + at this year's International Mathematical Olympiad (IMO), AlphaProof + and AlphaGeometry 2. + special: true + - text: OpenAI launched SearchGPT, an integrated web search + - text: Startup Udio has released Udio v1.5, an updated version + of its music creation model + - text: mistral ai has released a large language model Mistral Large + 2 in size 123B, which presents capabilities close to the closed SOTA + models. + special: true + - text: Midjourney v6.1 is released + - text: Google releases the Gemma 2 2B model as open source. + The model demonstrates better capabilities than much larger models. + - date: August + info: + - text: '"Black Forest Labs" releases weights for an image creation model named + Flux, which shows better performance than similar closedsource + models.' + - text: OpenAI released a new version of its model, gpt4o 0806, + achieving 100% success in generating valid JSON output. + - text: Google's image generation model, Imagen 3, has been released. + - text: xAI Corporation has launched the models Grok 2 and Grok + 2 mini, which demonstrate performance on par with leading SOTA models + in the market. + - text: Microsoft has introduced its small language models, Phi 3.5, + in three versions, each showcasing impressive performance relative to their + size. + - text: 'Google has introduced three new experimental AI models: Gemini + 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini + 1.5 Flash Updated.' + - text: Ideogram 2.0 has been released, offering image generation + capabilities that surpass those of other leading models. + - text: Luma has unveiled the Dream Machine 1.5 model for video + creation. + - date: September + info: + - text: The French AI company Mistral has introduced Pixtral12B, + its first multimodal model capable of processing both images and text. + - text: 'OPENAI has released two nextgeneration AI models to its subscribers: + o1 preview and o1 mini. These models show + a significant improvement in performance, particularly in tasks requiring + reasoning, including coding, mathematics, GPQA, and more.' + special: true + - text: Chinese company Alibaba releases the Qwen 2.5 model in + various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities + comparable to much larger models. + - text: The video generation model KLING 1.5 has been released. + - text: OpenAI launches the advanced voice mode + of GPT4o for all subscribers. + - text: Meta releases Llama 3.2 in sizes 1B, + 3B, 11B, and 90B, featuring image recognition capabilities for the first time. + - text: Google has rolled out new model updates ready for deployment, + Gemini Pro 1.5 002 and Gemini Flash 1.5 002, + showcasing significantly improved longcontext processing. + - text: Kyutai releases two opensource versions of its voicetovoice + model, Moshi. + - text: Google releases an update to its AI tool NotebookLM that + enables users to create podcasts based on their own content. + - text: Mistral AI launches a 22B model named Mistral Small. + - date: October + info: + - text: Flux 1.1 Pro is released, showcasing advanced capabilities + for image creation. + - text: Meta unveils Movie Gen, a new AI model that generates + videos, images, and audio from text input. + - text: Pika introduces Video Model 1.5 along with "Pika Effects." + - text: Adobe announces its video creation model, Firefly Video. + - text: Startup Rhymes AI releases Aria, an opensource, multimodal + model exhibiting capabilities similar to comparably sized proprietary models. + - text: Meta releases an opensource speechtospeech language model named Meta + Spirit LM. + - text: Mistral AI introduces Ministral, a new model available + in 3B and 8B parameter sizes. + - text: Janus AI, a multimodal language model capable of recognizing + and generating both text and images, is released as open source by DeepSeekAI. + - text: Google DeepMind and MIT unveil Fluid, a texttoimage generation + model with industryleading performance at a scale of 10.5B parameters. + - text: Stable Diffusion 3.5 is released in three sizes as open + source. + - text: Anthropic launches Claude 3.5 Sonnet New, demonstrating + significant advancements in specific areas over its previous version, and + announces Claude 3.5 Haiku. + - text: Anthropic announces an experimental feature for computer use with a public + beta API. + - text: The texttoimage model Recraft v3 has been released to + the public, ranking first in benchmarks compared to similar models. + - text: OpenAI has launched Search GPT, allowing users to perform + web searches directly within the platform. + - date: November + info: + - text: Alibaba released its new model, QwQ 32B Preview, + which integrates reasoning capabilities before responding. The model competes + with, and sometimes surpasses, OpenAI's o1-preview model. + - text: Alibaba opensourced the model Qwen2.5 Coder 32B, + which offers comparable capabilities to leading proprietary language models + in the coding domain. + - text: DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, + which incorporates reasoning capabilities and delivers impressive performance + on the AIME and MATH benchmarks, matching + the level of OpenAI's o1-preview. + - text: Suno upgraded its AIpowered music generator to v4, + introducing new features and performance improvements. + - text: Mistral AI launched the Pixtral Large + model, a multimodal language model excelling in image recognition and advanced + performance metrics. + - text: Google introduced two experimental models, gemini-exp-1114 + and gemini-exp-1121, currently leading the chatbot landscape + with enhanced performance. diff --git a/_includes/footer.html b/_includes/footer.html new file mode 100644 index 0000000..4b315f6 --- /dev/null +++ b/_includes/footer.html @@ -0,0 +1,19 @@ +
Midjourney v1
+ +Midjourney v2
+ +DALL-E 2 is announced for gradual release.
+ +Midjourney v3 is launched.
+ +Stable Diffusion 1.4 is released.
+ +Stable Diffusion 1.5 becomes available.
+ +Midjourney v4 is released.
+ +Stable Diffusion 2.0 is launched.
+ +ChatGPT 3.5, a large language model by OpenAI, is released to the public and quickly becomes a viral sensation.
+ +Stable Diffusion 2.1 is released.
+ +Meta releases the LLaMA language model as open-source for research purposes. The model is later leaked.
+ +Microsoft gradually releases Bing AI, an AI chat based on an upgraded GPT model integrating internet search.
+ +Midjourney v5 is launched.
+ +OpenAI's GPT-4 model is partially released, featuring multimodal image analysis and improved multi-language support.
+ +Google releases the AI chat Bard in a limited capacity, based on the LaMDA language model.
+ +Adobe releases the Firefly image creation model as a beta version to a waiting list. The model allowed a variety of capabilities including text formatting.
+ +Midjourney v5.1 is released.
+ +Google announces an upgrade to Bard, moving it to the upgraded PaLM 2 language model. It will support 180 countries and many languages.
+ +Midjourney v5.2 is launched.
+ +Stable Diffusion XL 1.0 is released.
+ +Anthropic announces a new version of their large language model - Claude 2.
+ +Meta releases the LLaMA 2 open source language model to the general public in a variety of sizes.
+ +DALL-E 3 is released.
+ +Adobe releases Firefly 2.
+ +Stable Diffusion XL Turbo is released - A fast model that allows the creation of an image in one step in real time.
+ +Midjourney v6 is launched.
+ +Google upgrades Bard in limited areas, moving it to be based on the upgraded Gemini Pro language model.
+ +X Corporation launches Grok AI chatbot for paid subscribers in English language.
+ +Stability AI announces Stable Diffusion 3 (gradually released to waiting list).
+ +Google upgrades the artificial intelligence chat in Bard, basing it on the new Gemini Pro model, in all available languages. Google replaces "Bard" with "Gemini".
+ +Google announces the Gemini Pro 1.5 multimodal language model capable of parsing up to a million tokens, as well as parsing video and images. The model is gradually released to developers on a waiting list.
+ +OpenAI announces the Sora model that produces videos up to a minute long. The model is not released to the public at this time.
+ +X Corporation announces the upcoming release of the Grok 1.5 open source model.
+ +Anthropic announces Claude 3, a new version of their large language model. The version is deployed in 3 different sizes, with the largest model performing better than GPT-4.
+ +Suno AI, which develops a model for creating music, releases Suno v3 to the general public.
+ +Stability AI releases a new update to the music creation model - Stable Audio 2.0.
+ +X Corporation releases an upgrade to its language model, Grok-1.5V, which integrates high-level image recognition. In the test presented by the company, the model is the best in identifying and analyzing images compared to other models.
+ +The Mistral company releases its new model Mixtral 8x22B as open source. This is the most powerful model among the open source models and it contains 141 billion parameters but uses a method that allows more economical use.
+ +Meta releases the LLaMA 3 model as open source in sizes 8B and 70B parameters. The large model shows better performance than Claude 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later release larger models with 400 billion parameters and more.
+ +Microsoft releases the Phi-3-mini model in open source. The model comes in a reduced version of 3.8B parameters, which allows it to run on mobile devices as well, and it presents capabilities similar to GPT-3.5.
+ +Adobe announces its new image creation model Firefly 3.
+ +The startup Reka AI presents a series of multimodal language models in 3 sizes. The models are capable of processing video, audio and images. The large model featured similar capabilities to GPT-4.
+ +Apple releases as full open source a series of small language models under the name OpenELM. The models are available in four weights between 270 million and 3 billion parameters.
+ +OpenAI announces the GPT-4-O model that presents full multimodal capabilities, including receiving and creating text, images, and audio. The model presents an impressive ability to speak with a high response speed and in natural language. The model is 2 times more efficient than the GPT-4 Turbo model, and has better capabilities for languages other than English.
+ +Google announces a large number of AI features in its products. The main ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, releasing a smaller and faster Gemini Flash 1.5 model. Revealing the latest image creation model Imagen 3, music creation model Music AI and video creation model Veo. And the announcement of the Astra model with multimodal capabilities for realtime audio and video reception.
+ +Microsoft announces Copilot+ for dedicated computers, which will allow a full search of the user's history through screenshots of the user's activity. The company also released as open source the SLMs that display impressive capabilities in a minimal size: Phi-3 Small, Phi-3 Medium, and Phi-3 Vision which includes image recognition capability.
+ +Meta introduces Chameleon, a new multimodal model that seamlessly renders text and images.
+ +Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3.
+ +Google announces AI Overviews intended to give a summary of the relevant information in Google search.
+ +Suno AI releases an updated music creation model Suno v3.5.
+ +Mistral AI releases a new language model designed for coding Codestral in size 22B.
+ +Stability AI releases its updated image creation model Stable Diffusion 3 in a medium version in size 2B parameters.
+ +Apple announces Apple Intelligence, an AI system that will be integrated into the company's devices and will combine AI models of different sizes for different tasks.
+ +DeepSeekAI publishes the DeepSeekCoderV2 open source language model which presents similar coding capabilities to models such as GPT-4, Claude 3 Opus and more.
+ +Runway introduces Gen3 Alpha, a new AI model for video generation.
+ +Anthropic releases the Claude Sonnet 3.5 model, which presents better capabilities than other models with low resource usage.
+ +Microsoft releases in open source a series of image recognition models called Florence 2.
+ +Google announces Gemma 2 open source language models with 9B and 27B parameter sizes. Also, the company opens the context window capabilities to developers for up to 2 million tokens.
+ +OpenAI has released a miniaturized model called gpt4o mini that presents high capabilities at a low cost
+ +Meta releases as open source the llama 3.1 model in sizes 8B 70B and 405B. The large model features the same capabilities as the best closed source models
+ +mistral ai releases three new models: Codestral Mamba, Mistral NeMo and Mathstral designed for mathematics
+ +Google DeepMind has unveiled two new AI systems that won silver medals at this year's International Mathematical Olympiad (IMO), AlphaProof and AlphaGeometry 2.
+ +OpenAI launched SearchGPT, an integrated web search
+ +Startup Udio has released Udio v1.5, an updated version of its music creation model
+ +mistral ai has released a large language model Mistral Large 2 in size 123B, which presents capabilities close to the closed SOTA models.
+ +Midjourney v6.1 is released
+ +Google releases the Gemma 2 2B model as open source. The model demonstrates better capabilities than much larger models.
+ +"Black Forest Labs" releases weights for an image creation model named Flux, which shows better performance than similar closedsource models.
+ +OpenAI released a new version of its model, gpt4o 0806, achieving 100% success in generating valid JSON output.
+ +Google's image generation model, Imagen 3, has been released.
+ +xAI Corporation has launched the models Grok 2 and Grok 2 mini, which demonstrate performance on par with leading SOTA models in the market.
+ +Microsoft has introduced its small language models, Phi 3.5, in three versions, each showcasing impressive performance relative to their size.
+ +Google has introduced three new experimental AI models: Gemini 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini 1.5 Flash Updated.
+ +Ideogram 2.0 has been released, offering image generation capabilities that surpass those of other leading models.
+ +Luma has unveiled the Dream Machine 1.5 model for video creation.
+ +The French AI company Mistral has introduced Pixtral12B, its first multimodal model capable of processing both images and text.
+ +OPENAI has released two nextgeneration AI models to its subscribers: o1 preview and o1 mini. These models show a significant improvement in performance, particularly in tasks requiring reasoning, including coding, mathematics, GPQA, and more.
+ +Chinese company Alibaba releases the Qwen 2.5 model in various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities comparable to much larger models.
+ +The video generation model KLING 1.5 has been released.
+ +OpenAI launches the advanced voice mode of GPT4o for all subscribers.
+ +Meta releases Llama 3.2 in sizes 1B, 3B, 11B, and 90B, featuring image recognition capabilities for the first time.
+ +Google has rolled out new model updates ready for deployment, Gemini Pro 1.5 002 and Gemini Flash 1.5 002, showcasing significantly improved longcontext processing.
+ +Kyutai releases two opensource versions of its voicetovoice model, Moshi.
+ +Google releases an update to its AI tool NotebookLM that enables users to create podcasts based on their own content.
+ +Mistral AI launches a 22B model named Mistral Small.
+ +Flux 1.1 Pro is released, showcasing advanced capabilities for image creation.
+ +Meta unveils Movie Gen, a new AI model that generates videos, images, and audio from text input.
+ +Pika introduces Video Model 1.5 along with "Pika Effects."
+ +Adobe announces its video creation model, Firefly Video.
+ +Startup Rhymes AI releases Aria, an opensource, multimodal model exhibiting capabilities similar to comparably sized proprietary models.
+ +Meta releases an opensource speechtospeech language model named Meta Spirit LM.
+ +Mistral AI introduces Ministral, a new model available in 3B and 8B parameter sizes.
+ +Janus AI, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeekAI.
+ +Google DeepMind and MIT unveil Fluid, a texttoimage generation model with industryleading performance at a scale of 10.5B parameters.
+ +Stable Diffusion 3.5 is released in three sizes as open source.
+ +Anthropic launches Claude 3.5 Sonnet New, demonstrating significant advancements in specific areas over its previous version, and announces Claude 3.5 Haiku.
+ +Anthropic announces an experimental feature for computer use with a public beta API.
+ +The texttoimage model Recraft v3 has been released to the public, ranking first in benchmarks compared to similar models.
+ +OpenAI has launched Search GPT, allowing users to perform web searches directly within the platform.
+ +Alibaba released its new model, QwQ 32B Preview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1-preview model.
+ +Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.
+ +DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1-preview.
+ +Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.
+ +Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.
+ +Google introduced two experimental models, gemini-exp-1114 and gemini-exp-1121, currently leading the chatbot landscape with enhanced performance.
+ +{event}
') - else: - html.append(f'\t{event}
') - - # Close the last article if exists - if current_article: - html.append('Alibaba released its new model, QwQ32BPreview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1preview model.
-Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.
-DeepSeek unveiled its new AI model, DeepSeekR1LitePreview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1preview.
-Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.
-Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.
-Google introduced two experimental models, geminiexp1114 and geminiexp1121, currently leading the chatbot landscape with enhanced performance.
-Midjourney v1
-Midjourney v2
-DALL-E 2 is announced for gradual release.
-Midjourney v3 is launched.
-Stable Diffusion 1.4 is released.
-Stable Diffusion 1.5 becomes available.
-Midjourney v4 is released.
-Stable Diffusion 2.0 is launched.
-ChatGPT 3.5, a large language model by OpenAI, is released to the public and quickly becomes a viral sensation.
-Stable Diffusion 2.1 is released.
-Meta releases the LLaMA language model as open-source for research purposes. The model is later leaked.
-Microsoft gradually releases Bing AI, an AI chat based on an upgraded GPT model integrating internet search.
-Midjourney v5 is launched.
-OpenAI's GPT-4 model is partially released, featuring multimodal image analysis and improved multi-language support.
-Google releases the AI chat Bard in a limited capacity, based on the LaMDA language model.
-Adobe releases the Firefly image creation model as a beta version to a waiting list. The model allowed a variety of capabilities including text formatting.
-Midjourney v5.1 is released.
-Google announces an upgrade to Bard, moving it to the upgraded PaLM 2 language model. It will support 180 countries and many languages.
-Midjourney v5.2 is launched.
-Stable Diffusion XL 1.0 is released.
-Anthropic announces a new version of their large language model - Claude 2.
-Meta releases the LLaMA 2 open source language model to the general public in a variety of sizes.
-DALL-E 3 is released.
-Adobe releases Firefly 2.
-Stable Diffusion XL Turbo is released - A fast model that allows the creation of an image in one step in real time.
-Midjourney v6 is launched.
-Google upgrades Bard in limited areas, moving it to be based on the upgraded Gemini Pro language model.
-X Corporation launches Grok AI chatbot for paid subscribers in English language.
-Stability AI announces Stable Diffusion 3 (gradually released to waiting list).
-Google upgrades the artificial intelligence chat in Bard, basing it on the new Gemini Pro model, in all available languages. Google replaces "Bard" with "Gemini".
-Google announces the Gemini Pro 1.5 multimodal language model capable of parsing up to a million tokens, as well as parsing video and images. The model is gradually released to developers on a waiting list.
-OpenAI announces the Sora model that produces videos up to a minute long. The model is not released to the public at this time.
-X Corporation announces the upcoming release of the Grok 1.5 open source model.
-Anthropic announces Claude 3, a new version of their large language model. The version is deployed in 3 different sizes, with the largest model performing better than GPT-4.
-Suno AI, which develops a model for creating music, releases Suno v3 to the general public.
-Stability AI releases a new update to the music creation model - Stable Audio 2.0.
-X Corporation releases an upgrade to its language model, Grok-1.5V, which integrates high-level image recognition. In the test presented by the company, the model is the best in identifying and analyzing images compared to other models.
-The Mistral company releases its new model Mixtral 8x22B as open source. This is the most powerful model among the open source models and it contains 141 billion parameters but uses a method that allows more economical use.
-Meta releases the LLaMA 3 model as open source in sizes 8B and 70B parameters. The large model shows better performance than Claude 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later release larger models with 400 billion parameters and more.
-Microsoft releases the Phi-3-mini model in open source. The model comes in a reduced version of 3.8B parameters, which allows it to run on mobile devices as well, and it presents capabilities similar to GPT-3.5.
-Adobe announces its new image creation model Firefly 3.
-The startup Reka AI presents a series of multimodal language models in 3 sizes. The models are capable of processing video, audio and images. The large model featured similar capabilities to GPT-4.
-Apple releases as full open source a series of small language models under the name OpenELM. The models are available in four weights between 270 million and 3 billion parameters.
-OpenAI announces the GPT-4-O model that presents full multimodal capabilities, including receiving and creating text, images, and audio. The model presents an impressive ability to speak with a high response speed and in natural language. The model is 2 times more efficient than the GPT-4 Turbo model, and has better capabilities for languages other than English.
-Google announces a large number of AI features in its products. The main ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, releasing a smaller and faster Gemini Flash 1.5 model. Revealing the latest image creation model Imagen 3, music creation model Music AI and video creation model Veo. And the announcement of the Astra model with multimodal capabilities for realtime audio and video reception.
-Microsoft announces Copilot+ for dedicated computers, which will allow a full search of the user's history through screenshots of the user's activity. The company also released as open source the SLMs that display impressive capabilities in a minimal size: Phi-3 Small, Phi-3 Medium, and Phi-3 Vision which includes image recognition capability.
-Meta introduces Chameleon, a new multimodal model that seamlessly renders text and images.
-Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3.
-Google announces AI Overviews intended to give a summary of the relevant information in Google search.
-Suno AI releases an updated music creation model Suno v3.5.
-Mistral AI releases a new language model designed for coding Codestral in size 22B.
-Stability AI releases its updated image creation model Stable Diffusion 3 in a medium version in size 2B parameters.
-Apple announces Apple Intelligence, an AI system that will be integrated into the company's devices and will combine AI models of different sizes for different tasks.
-DeepSeekAI publishes the DeepSeekCoderV2 open source language model which presents similar coding capabilities to models such as GPT-4, Claude 3 Opus and more.
-Runway introduces Gen3 Alpha, a new AI model for video generation.
-Anthropic releases the Claude Sonnet 3.5 model, which presents better capabilities than other models with low resource usage.
-Microsoft releases in open source a series of image recognition models called Florence 2.
-Google announces Gemma 2 open source language models with 9B and 27B parameter sizes. Also, the company opens the context window capabilities to developers for up to 2 million tokens.
-OpenAI has released a miniaturized model called gpt4o mini that presents high capabilities at a low cost
-Meta releases as open source the llama 3.1 model in sizes 8B 70B and 405B. The large model features the same capabilities as the best closed source models
-mistral ai releases three new models: Codestral Mamba, Mistral NeMo and Mathstral designed for mathematics
-Google DeepMind has unveiled two new AI systems that won silver medals at this year's International Mathematical Olympiad (IMO), AlphaProof and AlphaGeometry 2.
-OpenAI launched SearchGPT, an integrated web search
-Startup Udio has released Udio v1.5, an updated version of its music creation model
-mistral ai has released a large language model Mistral Large 2 in size 123B, which presents capabilities close to the closed SOTA models.
-Midjourney v6.1 is released
-Google releases the Gemma 2 2B model as open source. The model demonstrates better capabilities than much larger models.
-"Black Forest Labs" releases weights for an image creation model named Flux, which shows better performance than similar closedsource models.
-OpenAI released a new version of its model, gpt4o 0806, achieving 100% success in generating valid JSON output.
-Google's image generation model, Imagen 3, has been released.
-xAI Corporation has launched the models Grok 2 and Grok 2 mini, which demonstrate performance on par with leading SOTA models in the market.
-Microsoft has introduced its small language models, Phi 3.5, in three versions, each showcasing impressive performance relative to their size.
-Google has introduced three new experimental AI models: Gemini 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini 1.5 Flash Updated.
-Ideogram 2.0 has been released, offering image generation capabilities that surpass those of other leading models.
-Luma has unveiled the Dream Machine 1.5 model for video creation.
-The French AI company Mistral has introduced Pixtral12B, its first multimodal model capable of processing both images and text.
-OPENAI has released two nextgeneration AI models to its subscribers: o1 preview and o1 mini. These models show a significant improvement in performance, particularly in tasks requiring reasoning, including coding, mathematics, GPQA, and more.
-Chinese company Alibaba releases the Qwen 2.5 model in various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities comparable to much larger models.
-The video generation model KLING 1.5 has been released.
-OpenAI launches the advanced voice mode of GPT4o for all subscribers.
-Meta releases Llama 3.2 in sizes 1B, 3B, 11B, and 90B, featuring image recognition capabilities for the first time.
-Google has rolled out new model updates ready for deployment, Gemini Pro 1.5 002 and Gemini Flash 1.5 002, showcasing significantly improved longcontext processing.
-Kyutai releases two opensource versions of its voicetovoice model, Moshi.
-Google releases an update to its AI tool NotebookLM that enables users to create podcasts based on their own content.
-Mistral AI launches a 22B model named Mistral Small.
-Flux 1.1 Pro is released, showcasing advanced capabilities for image creation.
-Meta unveils Movie Gen, a new AI model that generates videos, images, and audio from text input.
-Pika introduces Video Model 1.5 along with "Pika Effects."
-Adobe announces its video creation model, Firefly Video.
-Startup Rhymes AI releases Aria, an opensource, multimodal model exhibiting capabilities similar to comparably sized proprietary models.
-Meta releases an opensource speechtospeech language model named Meta Spirit LM.
-Mistral AI introduces Ministral, a new model available in 3B and 8B parameter sizes.
-Janus AI, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeekAI.
-Google DeepMind and MIT unveil Fluid, a texttoimage generation model with industryleading performance at a scale of 10.5B parameters.
-Stable Diffusion 3.5 is released in three sizes as open source.
-Anthropic launches Claude 3.5 Sonnet New, demonstrating significant advancements in specific areas over its previous version, and announces Claude 3.5 Haiku.
-Anthropic announces an experimental feature for computer use with a public beta API.
-The texttoimage model Recraft v3 has been released to the public, ranking first in benchmarks compared to similar models.
-OpenAI has launched Search GPT, allowing users to perform web searches directly within the platform.
-Alibaba released its new model, QwQ 32B Preview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1-preview model.
-Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.
-DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1-preview.
-Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.
-Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.
-Google introduced two experimental models, gemini-exp-1114 and gemini-exp-1121, currently leading the chatbot landscape with enhanced performance.
-{{ info.text }}
+ {% endfor %} +