diff --git a/Gemfile b/Gemfile new file mode 100644 index 0000000..d31a08b --- /dev/null +++ b/Gemfile @@ -0,0 +1,2 @@ +source "https://rubygems.org" +gem "jekyll" \ No newline at end of file diff --git a/Gemfile.lock b/Gemfile.lock new file mode 100644 index 0000000..254b08d --- /dev/null +++ b/Gemfile.lock @@ -0,0 +1,74 @@ +GEM + remote: https://rubygems.org/ + specs: + addressable (2.8.7) + public_suffix (>= 2.0.2, < 7.0) + bigdecimal (3.1.8) + colorator (1.1.0) + concurrent-ruby (1.3.4) + em-websocket (0.5.3) + eventmachine (>= 0.12.9) + http_parser.rb (~> 0) + eventmachine (1.2.7) + ffi (1.17.0-x64-mingw-ucrt) + forwardable-extended (2.6.0) + google-protobuf (4.29.0-x64-mingw-ucrt) + bigdecimal + rake (>= 13) + http_parser.rb (0.8.0) + i18n (1.14.6) + concurrent-ruby (~> 1.0) + jekyll (4.3.4) + addressable (~> 2.4) + colorator (~> 1.0) + em-websocket (~> 0.5) + i18n (~> 1.0) + jekyll-sass-converter (>= 2.0, < 4.0) + jekyll-watch (~> 2.0) + kramdown (~> 2.3, >= 2.3.1) + kramdown-parser-gfm (~> 1.0) + liquid (~> 4.0) + mercenary (>= 0.3.6, < 0.5) + pathutil (~> 0.9) + rouge (>= 3.0, < 5.0) + safe_yaml (~> 1.0) + terminal-table (>= 1.8, < 4.0) + webrick (~> 1.7) + jekyll-sass-converter (3.0.0) + sass-embedded (~> 1.54) + jekyll-watch (2.2.1) + listen (~> 3.0) + kramdown (2.5.1) + rexml (>= 3.3.9) + kramdown-parser-gfm (1.1.0) + kramdown (~> 2.0) + liquid (4.0.4) + listen (3.9.0) + rb-fsevent (~> 0.10, >= 0.10.3) + rb-inotify (~> 0.9, >= 0.9.10) + mercenary (0.4.0) + pathutil (0.16.2) + forwardable-extended (~> 2.6) + public_suffix (6.0.1) + rake (13.2.1) + rb-fsevent (0.11.2) + rb-inotify (0.11.1) + ffi (~> 1.0) + rexml (3.3.9) + rouge (4.5.1) + safe_yaml (1.0.5) + sass-embedded (1.81.0-x64-mingw-ucrt) + google-protobuf (~> 4.28) + terminal-table (3.0.2) + unicode-display_width (>= 1.1.1, < 3) + unicode-display_width (2.6.0) + webrick (1.9.0) + +PLATFORMS + x64-mingw-ucrt + +DEPENDENCIES + jekyll + +BUNDLED WITH + 2.5.23 diff --git a/_data/links.yml b/_data/links.yml new file mode 100644 index 0000000..8a718cc --- /dev/null +++ b/_data/links.yml @@ -0,0 +1,19 @@ +- text: The Latest AI Innovations + url: https://www.futurepedia.io/ai-innovations +- text: Artificial Intelligence Index + url: https://aiindex.stanford.edu +- text: 'Generative AI Timeline: 9 Decades of Notable Milestones' + url: https://www.cmswire.com/digital-experience/generative-ai-timeline-9-decades-of-notable-milestones/ +- text: The History of Artificial Intelligence - Harvard + url: https://sitn.hms.harvard.edu/flash/2017/history-artificial-intelligence/ +- text: Analyzing 2023's Milestones and Forecasting 2024's Trends + url: https://masterofcode.com/blog/ai-highlights-2024 +- text: Timeline of Artificial Intelligence - Wikipedia + url: https://en.wikipedia.org/wiki/Timeline_of_artificial_intelligence +- text: The History of Artificial Intelligence - Complete AI Timeline + url: https://www.techtarget.com/searchenterpriseai/tip/The-history-of-artificial-intelligence-Complete-AI-timeline +- text: 'From the World Wide Web to AI: 11 Technology Milestones That Changed Our + Lives' + url: https://www.weforum.org/agenda/2024/03/11-technology-milestones-ai-quantum-computing-vr/ +- text: 'Artificial Intelligence (AI) and ChatGPT: History and Timelines' + url: https://www.officetimeline.com/blog/artificial-intelligence-ai-and-chatgpt-history-and-timelines diff --git a/_data/timeline.yml b/_data/timeline.yml new file mode 100644 index 0000000..6619ed2 --- /dev/null +++ b/_data/timeline.yml @@ -0,0 +1,302 @@ +- year: 2022 + events: + - date: February + info: + - text: Midjourney v1 + - date: April + info: + - text: Midjourney v2 + - text: DALL-E 2 is announced for gradual release. + special: true + - date: July + info: + - text: Midjourney v3 is launched. + - date: August + info: + - text: Stable Diffusion 1.4 is released. + - date: October + info: + - text: Stable Diffusion 1.5 becomes available. + - date: November + info: + - text: Midjourney v4 is released. + - text: Stable Diffusion 2.0 is launched. + - text: ChatGPT 3.5, a large language model by OpenAI, is released + to the public and quickly becomes a viral sensation. + special: true + - date: December + info: + - text: Stable Diffusion 2.1 is released. +- year: 2023 + events: + - date: February + info: + - text: Meta releases the LLaMA language model as open-source + for research purposes. The model is later leaked. + special: true + - text: Microsoft gradually releases Bing AI, an AI chat based + on an upgraded GPT model integrating internet search. + - date: March + info: + - text: Midjourney v5 is launched. + - text: OpenAI's GPT-4 model is partially released, featuring + multimodal image analysis and improved multi-language support. + - text: Google releases the AI chat Bard in a limited capacity, + based on the LaMDA language model. + - date: April + info: + - text: Adobe releases the Firefly image creation model as a + beta version to a waiting list. The model allowed a variety of capabilities + including text formatting. + - date: May + info: + - text: Midjourney v5.1 is released. + - text: Google announces an upgrade to Bard, moving it to the upgraded PaLM + 2 language model. It will support 180 countries and many languages. + - date: June + info: + - text: Midjourney v5.2 is launched. + - date: July + info: + - text: Stable Diffusion XL 1.0 is released. + - text: Anthropic announces a new version of their large language model - Claude + 2. + - text: Meta releases the LLaMA 2 open source language model + to the general public in a variety of sizes. + - date: October + info: + - text: DALL-E 3 is released. + - text: Adobe releases Firefly 2. + - date: November + info: + - text: Stable Diffusion XL Turbo is released - A fast model + that allows the creation of an image in one step in real time. + - date: December + info: + - text: Midjourney v6 is launched. + - text: Google upgrades Bard in limited areas, moving it to be based on the upgraded + Gemini Pro language model. + - text: X Corporation launches Grok AI chatbot for paid subscribers + in English language. +- year: 2024 + events: + - date: February + info: + - text: Stability AI announces Stable Diffusion 3 (gradually + released to waiting list). + - text: Google upgrades the artificial intelligence chat in Bard, basing it on + the new Gemini Pro model, in all available languages. Google + replaces "Bard" with "Gemini". + - text: Google announces the Gemini Pro 1.5 multimodal language + model capable of parsing up to a million tokens, as well as parsing video + and images. The model is gradually released to developers on a waiting list. + special: true + - text: OpenAI announces the Sora model that produces videos + up to a minute long. The model is not released to the public at this time. + special: true + - date: March + info: + - text: X Corporation announces the upcoming release of the Grok 1.5 + open source model. + - text: Anthropic announces Claude 3, a new version of their + large language model. The version is deployed in 3 different sizes, with the + largest model performing better than GPT-4. + - text: Suno AI, which develops a model for creating music, releases Suno + v3 to the general public. + - date: April + info: + - text: Stability AI releases a new update to the music creation model - Stable + Audio 2.0. + - text: X Corporation releases an upgrade to its language model, Grok-1.5V, + which integrates high-level image recognition. In the test presented by the + company, the model is the best in identifying and analyzing images compared + to other models. + - text: The Mistral company releases its new model Mixtral 8x22B + as open source. This is the most powerful model among the open source models + and it contains 141 billion parameters but uses a method that allows more + economical use. + - text: Meta releases the LLaMA 3 model as open source in sizes + 8B and 70B parameters. The large model shows better performance than Claude + 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later + release larger models with 400 billion parameters and more. + - text: Microsoft releases the Phi-3-mini model in open source. + The model comes in a reduced version of 3.8B parameters, which allows it to + run on mobile devices as well, and it presents capabilities similar to GPT-3.5. + special: true + - text: Adobe announces its new image creation model Firefly 3. + - text: The startup Reka AI presents a series of multimodal language + models in 3 sizes. The models are capable of processing video, audio and images. + The large model featured similar capabilities to GPT-4. + - text: Apple releases as full open source a series of small language models under + the name OpenELM. The models are available in four weights + between 270 million and 3 billion parameters. + - date: May + info: + - text: OpenAI announces the GPT-4-O model that presents full + multimodal capabilities, including receiving and creating text, images, and + audio. The model presents an impressive ability to speak with a high response + speed and in natural language. The model is 2 times more efficient than the + GPT-4 Turbo model, and has better capabilities for languages other than English. + special: true + - text: 'Google announces a large number of AI features in its products. The main + ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, + releasing a smaller and faster Gemini Flash 1.5 model. Revealing + the latest image creation model Imagen 3, music creation + model Music AI and video creation model Veo. + And the announcement of the Astra model with multimodal capabilities + for realtime audio and video reception.' + - text: 'Microsoft announces Copilot+ for dedicated computers, + which will allow a full search of the user''s history through screenshots + of the user''s activity. The company also released as open source the SLMs + that display impressive capabilities in a minimal size: Phi-3 Small, + Phi-3 Medium, and Phi-3 Vision which includes + image recognition capability.' + - text: Meta introduces Chameleon, a new multimodal model that + seamlessly renders text and images. + - text: Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3. + - text: Google announces AI Overviews intended to give a summary + of the relevant information in Google search. + special: true + - text: Suno AI releases an updated music creation model Suno v3.5. + - text: Mistral AI releases a new language model designed for coding Codestral + in size 22B. + - date: June + info: + - text: Stability AI releases its updated image creation model Stable + Diffusion 3 in a medium version in size 2B parameters. + - text: Apple announces Apple Intelligence, an AI system that + will be integrated into the company's devices and will combine AI models of + different sizes for different tasks. + - text: DeepSeekAI publishes the DeepSeekCoderV2 open source + language model which presents similar coding capabilities to models such as + GPT-4, Claude 3 Opus and more. + - text: Runway introduces Gen3 Alpha, a new + AI model for video generation. + - text: Anthropic releases the Claude Sonnet 3.5 model, which + presents better capabilities than other models with low resource usage. + special: true + - text: Microsoft releases in open source a series of image recognition models + called Florence 2. + - text: Google announces Gemma 2 open source language models + with 9B and 27B parameter sizes. Also, the company opens the context window + capabilities to developers for up to 2 million tokens. + - date: July + info: + - text: OpenAI has released a miniaturized model called gpt4o mini + that presents high capabilities at a low cost + - text: Meta releases as open source the llama 3.1 model in sizes + 8B 70B and 405B. The large model features the same capabilities as the best + closed source models + special: true + - text: 'mistral ai releases three new models: Codestral Mamba, + Mistral NeMo and Mathstral designed for + mathematics' + - text: Google DeepMind has unveiled two new AI systems that won silver medals + at this year's International Mathematical Olympiad (IMO), AlphaProof + and AlphaGeometry 2. + special: true + - text: OpenAI launched SearchGPT, an integrated web search + - text: Startup Udio has released Udio v1.5, an updated version + of its music creation model + - text: mistral ai has released a large language model Mistral Large + 2 in size 123B, which presents capabilities close to the closed SOTA + models. + special: true + - text: Midjourney v6.1 is released + - text: Google releases the Gemma 2 2B model as open source. + The model demonstrates better capabilities than much larger models. + - date: August + info: + - text: '"Black Forest Labs" releases weights for an image creation model named + Flux, which shows better performance than similar closedsource + models.' + - text: OpenAI released a new version of its model, gpt4o 0806, + achieving 100% success in generating valid JSON output. + - text: Google's image generation model, Imagen 3, has been released. + - text: xAI Corporation has launched the models Grok 2 and Grok + 2 mini, which demonstrate performance on par with leading SOTA models + in the market. + - text: Microsoft has introduced its small language models, Phi 3.5, + in three versions, each showcasing impressive performance relative to their + size. + - text: 'Google has introduced three new experimental AI models: Gemini + 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini + 1.5 Flash Updated.' + - text: Ideogram 2.0 has been released, offering image generation + capabilities that surpass those of other leading models. + - text: Luma has unveiled the Dream Machine 1.5 model for video + creation. + - date: September + info: + - text: The French AI company Mistral has introduced Pixtral12B, + its first multimodal model capable of processing both images and text. + - text: 'OPENAI has released two nextgeneration AI models to its subscribers: + o1 preview and o1 mini. These models show + a significant improvement in performance, particularly in tasks requiring + reasoning, including coding, mathematics, GPQA, and more.' + special: true + - text: Chinese company Alibaba releases the Qwen 2.5 model in + various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities + comparable to much larger models. + - text: The video generation model KLING 1.5 has been released. + - text: OpenAI launches the advanced voice mode + of GPT4o for all subscribers. + - text: Meta releases Llama 3.2 in sizes 1B, + 3B, 11B, and 90B, featuring image recognition capabilities for the first time. + - text: Google has rolled out new model updates ready for deployment, + Gemini Pro 1.5 002 and Gemini Flash 1.5 002, + showcasing significantly improved longcontext processing. + - text: Kyutai releases two opensource versions of its voicetovoice + model, Moshi. + - text: Google releases an update to its AI tool NotebookLM that + enables users to create podcasts based on their own content. + - text: Mistral AI launches a 22B model named Mistral Small. + - date: October + info: + - text: Flux 1.1 Pro is released, showcasing advanced capabilities + for image creation. + - text: Meta unveils Movie Gen, a new AI model that generates + videos, images, and audio from text input. + - text: Pika introduces Video Model 1.5 along with "Pika Effects." + - text: Adobe announces its video creation model, Firefly Video. + - text: Startup Rhymes AI releases Aria, an opensource, multimodal + model exhibiting capabilities similar to comparably sized proprietary models. + - text: Meta releases an opensource speechtospeech language model named Meta + Spirit LM. + - text: Mistral AI introduces Ministral, a new model available + in 3B and 8B parameter sizes. + - text: Janus AI, a multimodal language model capable of recognizing + and generating both text and images, is released as open source by DeepSeekAI. + - text: Google DeepMind and MIT unveil Fluid, a texttoimage generation + model with industryleading performance at a scale of 10.5B parameters. + - text: Stable Diffusion 3.5 is released in three sizes as open + source. + - text: Anthropic launches Claude 3.5 Sonnet New, demonstrating + significant advancements in specific areas over its previous version, and + announces Claude 3.5 Haiku. + - text: Anthropic announces an experimental feature for computer use with a public + beta API. + - text: The texttoimage model Recraft v3 has been released to + the public, ranking first in benchmarks compared to similar models. + - text: OpenAI has launched Search GPT, allowing users to perform + web searches directly within the platform. + - date: November + info: + - text: Alibaba released its new model, QwQ 32B Preview, + which integrates reasoning capabilities before responding. The model competes + with, and sometimes surpasses, OpenAI's o1-preview model. + - text: Alibaba opensourced the model Qwen2.5 Coder 32B, + which offers comparable capabilities to leading proprietary language models + in the coding domain. + - text: DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, + which incorporates reasoning capabilities and delivers impressive performance + on the AIME and MATH benchmarks, matching + the level of OpenAI's o1-preview. + - text: Suno upgraded its AIpowered music generator to v4, + introducing new features and performance improvements. + - text: Mistral AI launched the Pixtral Large + model, a multimodal language model excelling in image recognition and advanced + performance metrics. + - text: Google introduced two experimental models, gemini-exp-1114 + and gemini-exp-1121, currently leading the chatbot landscape + with enhanced performance. diff --git a/_includes/footer.html b/_includes/footer.html new file mode 100644 index 0000000..4b315f6 --- /dev/null +++ b/_includes/footer.html @@ -0,0 +1,19 @@ +
+ + +
\ No newline at end of file diff --git a/_layouts/default.html b/_layouts/default.html new file mode 100644 index 0000000..baad9d9 --- /dev/null +++ b/_layouts/default.html @@ -0,0 +1,19 @@ + + + + + + + + + + {{ page.title }} + + + + + + {{ content }} + + + \ No newline at end of file diff --git a/_site/LICENSE b/_site/LICENSE new file mode 100644 index 0000000..ab13e36 --- /dev/null +++ b/_site/LICENSE @@ -0,0 +1,21 @@ +MIT License + +Copyright (c) 2024 NHLOCAL + +Permission is hereby granted, free of charge, to any person obtaining a copy +of this software and associated documentation files (the "Software"), to deal +in the Software without restriction, including without limitation the rights +to use, copy, modify, merge, publish, distribute, sublicense, and/or sell +copies of the Software, and to permit persons to whom the Software is +furnished to do so, subject to the following conditions: + +The above copyright notice and this permission notice shall be included in all +copies or substantial portions of the Software. + +THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR +IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, +FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE +AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER +LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, +OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE +SOFTWARE. diff --git a/_site/README.md b/_site/README.md new file mode 100644 index 0000000..22d36c9 --- /dev/null +++ b/_site/README.md @@ -0,0 +1,36 @@ +# AiTimeline + +Welcome to the Artificial Intelligence Timeline, showcasing the evolution and advancements in artificial intelligence technologies from 2022 to 2024. This timeline highlights key milestones, releases, and developments from leading companies and projects in the AI field. + +## Overview + +This timeline is structured chronologically, organized by year, and includes significant events, releases, and updates in the artificial intelligence landscape. From the introduction of new models and updates to open-source releases and major announcements, this timeline provides a comprehensive overview of the dynamic AI industry's progress over the years. + +## Features + +- **Year-wise Organization:** Events and releases are grouped by year for easy navigation and understanding. +- **Detailed Descriptions:** Each event includes a brief description, providing context and details about the milestones. +- **Multiple Events Handling:** For months with multiple events or updates, events are listed as bullet points under the respective month for clarity. +- **Responsive Design:** The timeline is designed to be responsive and accessible on various devices, ensuring a seamless viewing experience. + +## How to Use + +1. **View the Timeline:** Visit the [Artificial Intelligence Timeline](https://nhlocal.github.io/AiTimeline/) to explore the events and developments in the AI industry. +2. **Navigate the Timeline:** Scroll through the timeline to view events chronologically, or use the year indicators to jump to specific years. +3. **Explore Events:** Click on individual events to view detailed descriptions and learn more about each milestone. + +## Contributing + +Contributions are welcome! If you have suggestions, corrections, or additional events to include in the timeline, please feel free to open an issue or submit a pull request. + +## About + +This timeline was created as a project to document and showcase the advancements in artificial intelligence technologies. It serves as a resource for researchers, enthusiasts, and anyone interested in tracking the progress and evolution of AI technologies over the years. + +## License + +This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details. + +--- + +Thank you for visiting the Artificial Intelligence Timeline! We hope you find this resource informative and useful. For any questions or inquiries, please [contact us](mailto:nh.local11@gmail.com). diff --git a/styles.css b/_site/assets/css/styles.css similarity index 100% rename from styles.css rename to _site/assets/css/styles.css diff --git a/favicon.png b/_site/assets/favicon.png similarity index 100% rename from favicon.png rename to _site/assets/favicon.png diff --git a/script.js b/_site/assets/js/script.js similarity index 100% rename from script.js rename to _site/assets/js/script.js diff --git a/_site/assets/preview.png b/_site/assets/preview.png new file mode 100644 index 0000000..fea9965 Binary files /dev/null and b/_site/assets/preview.png differ diff --git a/_site/index.html b/_site/index.html new file mode 100644 index 0000000..6fdfcf9 --- /dev/null +++ b/_site/index.html @@ -0,0 +1,455 @@ + + + + + + + + + + AI Timeline + + + + + +
+
+ + + +
+
+ +
+

Artificial Intelligence Timeline

+

2022 - Present

+
+ + + +
+ + +
+

2022

+ + +
+

February

+ +

Midjourney v1

+ +
+ +
+

April

+ +

Midjourney v2

+ +

DALL-E 2 is announced for gradual release.

+ +
+ +
+

July

+ +

Midjourney v3 is launched.

+ +
+ +
+

August

+ +

Stable Diffusion 1.4 is released.

+ +
+ +
+

October

+ +

Stable Diffusion 1.5 becomes available.

+ +
+ +
+

November

+ +

Midjourney v4 is released.

+ +

Stable Diffusion 2.0 is launched.

+ +

ChatGPT 3.5, a large language model by OpenAI, is released to the public and quickly becomes a viral sensation.

+ +
+ +
+

December

+ +

Stable Diffusion 2.1 is released.

+ +
+ +
+ +
+

2023

+ + +
+

February

+ +

Meta releases the LLaMA language model as open-source for research purposes. The model is later leaked.

+ +

Microsoft gradually releases Bing AI, an AI chat based on an upgraded GPT model integrating internet search.

+ +
+ +
+

March

+ +

Midjourney v5 is launched.

+ +

OpenAI's GPT-4 model is partially released, featuring multimodal image analysis and improved multi-language support.

+ +

Google releases the AI chat Bard in a limited capacity, based on the LaMDA language model.

+ +
+ +
+

April

+ +

Adobe releases the Firefly image creation model as a beta version to a waiting list. The model allowed a variety of capabilities including text formatting.

+ +
+ +
+

May

+ +

Midjourney v5.1 is released.

+ +

Google announces an upgrade to Bard, moving it to the upgraded PaLM 2 language model. It will support 180 countries and many languages.

+ +
+ +
+

June

+ +

Midjourney v5.2 is launched.

+ +
+ +
+

July

+ +

Stable Diffusion XL 1.0 is released.

+ +

Anthropic announces a new version of their large language model - Claude 2.

+ +

Meta releases the LLaMA 2 open source language model to the general public in a variety of sizes.

+ +
+ +
+

October

+ +

DALL-E 3 is released.

+ +

Adobe releases Firefly 2.

+ +
+ +
+

November

+ +

Stable Diffusion XL Turbo is released - A fast model that allows the creation of an image in one step in real time.

+ +
+ +
+

December

+ +

Midjourney v6 is launched.

+ +

Google upgrades Bard in limited areas, moving it to be based on the upgraded Gemini Pro language model.

+ +

X Corporation launches Grok AI chatbot for paid subscribers in English language.

+ +
+ +
+ +
+

2024

+ + +
+

February

+ +

Stability AI announces Stable Diffusion 3 (gradually released to waiting list).

+ +

Google upgrades the artificial intelligence chat in Bard, basing it on the new Gemini Pro model, in all available languages. Google replaces "Bard" with "Gemini".

+ +

Google announces the Gemini Pro 1.5 multimodal language model capable of parsing up to a million tokens, as well as parsing video and images. The model is gradually released to developers on a waiting list.

+ +

OpenAI announces the Sora model that produces videos up to a minute long. The model is not released to the public at this time.

+ +
+ +
+

March

+ +

X Corporation announces the upcoming release of the Grok 1.5 open source model.

+ +

Anthropic announces Claude 3, a new version of their large language model. The version is deployed in 3 different sizes, with the largest model performing better than GPT-4.

+ +

Suno AI, which develops a model for creating music, releases Suno v3 to the general public.

+ +
+ +
+

April

+ +

Stability AI releases a new update to the music creation model - Stable Audio 2.0.

+ +

X Corporation releases an upgrade to its language model, Grok-1.5V, which integrates high-level image recognition. In the test presented by the company, the model is the best in identifying and analyzing images compared to other models.

+ +

The Mistral company releases its new model Mixtral 8x22B as open source. This is the most powerful model among the open source models and it contains 141 billion parameters but uses a method that allows more economical use.

+ +

Meta releases the LLaMA 3 model as open source in sizes 8B and 70B parameters. The large model shows better performance than Claude 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later release larger models with 400 billion parameters and more.

+ +

Microsoft releases the Phi-3-mini model in open source. The model comes in a reduced version of 3.8B parameters, which allows it to run on mobile devices as well, and it presents capabilities similar to GPT-3.5.

+ +

Adobe announces its new image creation model Firefly 3.

+ +

The startup Reka AI presents a series of multimodal language models in 3 sizes. The models are capable of processing video, audio and images. The large model featured similar capabilities to GPT-4.

+ +

Apple releases as full open source a series of small language models under the name OpenELM. The models are available in four weights between 270 million and 3 billion parameters.

+ +
+ +
+

May

+ +

OpenAI announces the GPT-4-O model that presents full multimodal capabilities, including receiving and creating text, images, and audio. The model presents an impressive ability to speak with a high response speed and in natural language. The model is 2 times more efficient than the GPT-4 Turbo model, and has better capabilities for languages other than English.

+ +

Google announces a large number of AI features in its products. The main ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, releasing a smaller and faster Gemini Flash 1.5 model. Revealing the latest image creation model Imagen 3, music creation model Music AI and video creation model Veo. And the announcement of the Astra model with multimodal capabilities for realtime audio and video reception.

+ +

Microsoft announces Copilot+ for dedicated computers, which will allow a full search of the user's history through screenshots of the user's activity. The company also released as open source the SLMs that display impressive capabilities in a minimal size: Phi-3 Small, Phi-3 Medium, and Phi-3 Vision which includes image recognition capability.

+ +

Meta introduces Chameleon, a new multimodal model that seamlessly renders text and images.

+ +

Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3.

+ +

Google announces AI Overviews intended to give a summary of the relevant information in Google search.

+ +

Suno AI releases an updated music creation model Suno v3.5.

+ +

Mistral AI releases a new language model designed for coding Codestral in size 22B.

+ +
+ +
+

June

+ +

Stability AI releases its updated image creation model Stable Diffusion 3 in a medium version in size 2B parameters.

+ +

Apple announces Apple Intelligence, an AI system that will be integrated into the company's devices and will combine AI models of different sizes for different tasks.

+ +

DeepSeekAI publishes the DeepSeekCoderV2 open source language model which presents similar coding capabilities to models such as GPT-4, Claude 3 Opus and more.

+ +

Runway introduces Gen3 Alpha, a new AI model for video generation.

+ +

Anthropic releases the Claude Sonnet 3.5 model, which presents better capabilities than other models with low resource usage.

+ +

Microsoft releases in open source a series of image recognition models called Florence 2.

+ +

Google announces Gemma 2 open source language models with 9B and 27B parameter sizes. Also, the company opens the context window capabilities to developers for up to 2 million tokens.

+ +
+ +
+

July

+ +

OpenAI has released a miniaturized model called gpt4o mini that presents high capabilities at a low cost

+ +

Meta releases as open source the llama 3.1 model in sizes 8B 70B and 405B. The large model features the same capabilities as the best closed source models

+ +

mistral ai releases three new models: Codestral Mamba, Mistral NeMo and Mathstral designed for mathematics

+ +

Google DeepMind has unveiled two new AI systems that won silver medals at this year's International Mathematical Olympiad (IMO), AlphaProof and AlphaGeometry 2.

+ +

OpenAI launched SearchGPT, an integrated web search

+ +

Startup Udio has released Udio v1.5, an updated version of its music creation model

+ +

mistral ai has released a large language model Mistral Large 2 in size 123B, which presents capabilities close to the closed SOTA models.

+ +

Midjourney v6.1 is released

+ +

Google releases the Gemma 2 2B model as open source. The model demonstrates better capabilities than much larger models.

+ +
+ +
+

August

+ +

"Black Forest Labs" releases weights for an image creation model named Flux, which shows better performance than similar closedsource models.

+ +

OpenAI released a new version of its model, gpt4o 0806, achieving 100% success in generating valid JSON output.

+ +

Google's image generation model, Imagen 3, has been released.

+ +

xAI Corporation has launched the models Grok 2 and Grok 2 mini, which demonstrate performance on par with leading SOTA models in the market.

+ +

Microsoft has introduced its small language models, Phi 3.5, in three versions, each showcasing impressive performance relative to their size.

+ +

Google has introduced three new experimental AI models: Gemini 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini 1.5 Flash Updated.

+ +

Ideogram 2.0 has been released, offering image generation capabilities that surpass those of other leading models.

+ +

Luma has unveiled the Dream Machine 1.5 model for video creation.

+ +
+ +
+

September

+ +

The French AI company Mistral has introduced Pixtral12B, its first multimodal model capable of processing both images and text.

+ +

OPENAI has released two nextgeneration AI models to its subscribers: o1 preview and o1 mini. These models show a significant improvement in performance, particularly in tasks requiring reasoning, including coding, mathematics, GPQA, and more.

+ +

Chinese company Alibaba releases the Qwen 2.5 model in various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities comparable to much larger models.

+ +

The video generation model KLING 1.5 has been released.

+ +

OpenAI launches the advanced voice mode of GPT4o for all subscribers.

+ +

Meta releases Llama 3.2 in sizes 1B, 3B, 11B, and 90B, featuring image recognition capabilities for the first time.

+ +

Google has rolled out new model updates ready for deployment, Gemini Pro 1.5 002 and Gemini Flash 1.5 002, showcasing significantly improved longcontext processing.

+ +

Kyutai releases two opensource versions of its voicetovoice model, Moshi.

+ +

Google releases an update to its AI tool NotebookLM that enables users to create podcasts based on their own content.

+ +

Mistral AI launches a 22B model named Mistral Small.

+ +
+ +
+

October

+ +

Flux 1.1 Pro is released, showcasing advanced capabilities for image creation.

+ +

Meta unveils Movie Gen, a new AI model that generates videos, images, and audio from text input.

+ +

Pika introduces Video Model 1.5 along with "Pika Effects."

+ +

Adobe announces its video creation model, Firefly Video.

+ +

Startup Rhymes AI releases Aria, an opensource, multimodal model exhibiting capabilities similar to comparably sized proprietary models.

+ +

Meta releases an opensource speechtospeech language model named Meta Spirit LM.

+ +

Mistral AI introduces Ministral, a new model available in 3B and 8B parameter sizes.

+ +

Janus AI, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeekAI.

+ +

Google DeepMind and MIT unveil Fluid, a texttoimage generation model with industryleading performance at a scale of 10.5B parameters.

+ +

Stable Diffusion 3.5 is released in three sizes as open source.

+ +

Anthropic launches Claude 3.5 Sonnet New, demonstrating significant advancements in specific areas over its previous version, and announces Claude 3.5 Haiku.

+ +

Anthropic announces an experimental feature for computer use with a public beta API.

+ +

The texttoimage model Recraft v3 has been released to the public, ranking first in benchmarks compared to similar models.

+ +

OpenAI has launched Search GPT, allowing users to perform web searches directly within the platform.

+ +
+ +
+

November

+ +

Alibaba released its new model, QwQ 32B Preview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1-preview model.

+ +

Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.

+ +

DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1-preview.

+ +

Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.

+ +

Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.

+ +

Google introduced two experimental models, gemini-exp-1114 and gemini-exp-1121, currently leading the chatbot landscape with enhanced performance.

+ +
+ +
+ + +
+ + + + + + + + + + + \ No newline at end of file diff --git a/summary.html b/_site/legacy/summary.html similarity index 100% rename from summary.html rename to _site/legacy/summary.html diff --git a/assets/css/styles.css b/assets/css/styles.css new file mode 100644 index 0000000..9d83907 --- /dev/null +++ b/assets/css/styles.css @@ -0,0 +1,530 @@ +/* Base Styles */ +body { + font-family: 'Roboto', sans-serif; + margin: 0; + padding: 0; + line-height: 1.5; + color: #333; + background-color: #f9f9f9; +} + +/* Header Styles */ +.header { + text-align: center; + padding: 20px 0; + background-color: #007BFF; + color: #fff; + position: relative; +} + +.header h1 { + font-size: 42px; + margin-bottom: 10px; +} + +.header h2 { + font-size: 28px; + font-weight: normal; +} + +/* GitHub Button */ +.github-button-container { + position: absolute; + top: 20px; + left: 20px; +} + +.github-button svg { + width: 30px; + height: 30px; + fill: #fff; + transition: transform 0.3s; +} + +.github-button:hover svg { + transform: scale(1.1); +} + +/* Dark Mode Toggle */ +.dark-mode-toggle-container { + position: absolute; + top: 20px; + right: 20px; +} + +.dark-mode-toggle { + background-color: transparent; + border: none; + font-size: 24px; + color: #fff; + cursor: pointer; + transition: transform 0.3s; +} + +.dark-mode-toggle:hover { + transform: scale(1.1); +} + +/* Navigation Bar (Index Page) */ +.year-nav { + text-align: center; + margin-bottom: 30px; + position: sticky; + top: 0; + background-color: #f9f9f9; + z-index: 100; + padding: 10px 0; +} + +.year-nav a { + display: inline-block; + margin: 0 10px; + padding: 8px 15px; + background-color: #f9f9f9; + border: 2px solid #007BFF; + border-radius: 20px; + text-decoration: none; + color: #333; + font-weight: bold; + transition: background-color 0.3s ease, color 0.3s ease; +} + +.year-nav a:hover { + background-color: #007BFF; + color: #fff; +} + +.year-nav a.active { + background-color: #007BFF; + color: #fff; +} + +/* Timeline Styles (Index Page) */ +.timeline { + max-width: 1000px; + margin: 0 auto; + position: relative; + padding: 0 20px; +} + +.year { + margin-bottom: 60px; +} + +.year h2 { + font-size: 32px; + color: #007BFF; + border-bottom: 2px solid #007BFF; + padding-bottom: 10px; + margin-bottom: 30px; +} + +.event { + border-left: 2px solid #007BFF; + padding-left: 30px; + margin-bottom: 40px; + position: relative; + padding-bottom: 20px; +} + +.event .date { + font-size: 22px; + font-weight: bold; + color: #555; + margin-bottom: 10px; +} + +.event .info { + margin-bottom: 10px; + position: relative; + padding-left: 20px; +} + +.event::before { + content: ''; + position: absolute; + top: 0; + left: -9px; + width: 18px; + height: 18px; + border-radius: 50%; + background-color: #007BFF; +} + +.info::before { + content: ''; + position: absolute; + top: 50%; + left: 0; + transform: translateY(-50%); + width: 8px; + height: 8px; + border-radius: 50%; + background-color: #007BFF; +} + +.info.special::before { + background-color: #b80003; + width: 9.5px; + height: 9.5px; +} + +/* Hover Effects */ +.event:hover { + background-color: #f4f4f4; +} + +/* Footer Styles */ +.footer { + text-align: center; + background-color: #f9f9f9; + padding: 20px 0; + border-top: 2px solid #007BFF; +} + +.footer .content { + max-width: 1000px; + margin: 0 auto; +} + +.footer h3 { + font-size: 24px; + color: #333; + margin-bottom: 15px; + margin-top: 10px; + text-align: left; + padding-left: 20px; +} + +.footer ul { + list-style: none; + padding: 0; + margin: 0; + padding-left: 30px; +} + +.footer ul li { + font-size: 18px; + color: #555; + margin-bottom: 10px; + padding-left: 20px; + text-align: left; + position: relative; +} + +.footer ul li a { + color: #007BFF; + text-decoration: none; + transition: color 0.3s; +} + +.footer ul li a:hover { + color: #0056b3; + text-decoration: underline; +} + +.footer ul li::before { + content: ''; + position: absolute; + top: 50%; + left: 0; + transform: translateY(-50%); + width: 8px; + height: 8px; + border-radius: 50%; + background-color: #007BFF; +} + +/* Topic Navigation Bar (Summary Page) */ +.topic-nav { + display: flex; + flex-wrap: wrap; + justify-content: center; + background-color: #0056b3; + padding: 10px 0; + position: sticky; + top: 0; + z-index: 1000; +} + +.topic-nav a { + color: #fff; + text-decoration: none; + margin: 5px 15px; + font-weight: bold; + transition: color 0.3s; +} + +.topic-nav a:hover { + color: #ffd700; +} + +.topic-nav a.active { + border-bottom: 2px solid #ffd700; + padding-bottom: 5px; +} + +/* Summary Container (Summary Page) */ +.summary-container { + max-width: 1200px; + margin: 40px auto; + padding: 0 20px; +} + +/* Topic Section (Summary Page) */ +.topic-section { + margin-bottom: 60px; +} + +.topic-section h2 { + font-size: 32px; + color: #007BFF; + border-bottom: 2px solid #007BFF; + padding-bottom: 10px; + margin-bottom: 30px; +} + +/* Card (Summary Page) */ +.card { + background-color: #fff; + border: 1px solid #ddd; + padding: 20px; + border-radius: 8px; + margin-bottom: 20px; +} + +.card h3 { + font-size: 24px; + color: #333; + margin-bottom: 15px; +} + +.card ul { + list-style: none; + padding: 0; + color: #555; +} + +.card ul li { + margin-bottom: 10px; + padding-left: 25px; + position: relative; +} + +.card ul li::before { + content: ''; + position: absolute; + left: 10px; + top: 12px; + width: 6px; + height: 6px; + background-color: #007BFF; + border-radius: 50%; +} + +.card ul li strong { + color: #333; +} + +/* Footer Styles (Shared) */ +footer { + background-color: #007BFF; + color: #fff; + padding: 40px 0; +} + +footer .content { + max-width: 1200px; + margin: 0 auto; + display: flex; + flex-wrap: wrap; + justify-content: space-between; + padding: 0 20px; +} + +.footer-section { + flex: 1 1 45%; + margin-bottom: 20px; +} + +.footer-section h3 { + margin-bottom: 15px; +} + +.footer-section p, +.footer-section ul { + margin-bottom: 15px; +} + +.footer-section ul { + list-style: none; + padding: 0; +} + +.footer-section ul li { + margin-bottom: 10px; +} + +.footer-section a { + color: #ffd700; + text-decoration: none; + transition: color 0.3s; +} + +.footer-section a:hover { + color: #fff; +} + +.github-link { + display: inline-flex; + align-items: center; +} + +.github-link svg { + margin-right: 8px; + fill: #ffd700; +} + +/* Dark Mode Styles */ +body.dark-mode { + background-color: #121212; + color: #e0e0e0; +} + +body.dark-mode .header { + background-color: #1f1f1f; + color: #e0e0e0; +} + +body.dark-mode .year-nav, +body.dark-mode .topic-nav { + background-color: #1f1f1f; +} + +body.dark-mode .year-nav a, +body.dark-mode .topic-nav a { + color: #e0e0e0; + background-color: transparent; + border-color: #555; +} + +body.dark-mode .year-nav a:hover, +body.dark-mode .topic-nav a:hover { + color: #ffd700; + background-color: #333; +} + +body.dark-mode .year-nav a.active, +body.dark-mode .topic-nav a.active { + background-color: #333; + color: #ffd700; + border-color: #ffd700; +} + +body.dark-mode .event { + background-color: #1c1c1c; + border-color: #333; +} + +body.dark-mode .event .date { + color: #ffd700; +} + +body.dark-mode .event .info { + color: #e0e0e0; +} + +body.dark-mode .card { + background-color: #1c1c1c; + border-color: #333; +} + +body.dark-mode .card h3 { + color: #ffd700; +} + +body.dark-mode .card ul li { + color: #e0e0e0; +} + +body.dark-mode .card ul li strong { + color: #ffffff; +} + +body.dark-mode .footer, +body.dark-mode footer { + background-color: #1f1f1f; + color: #e0e0e0; +} + +body.dark-mode .footer-section a, +body.dark-mode .github-link svg { + color: #ffd700; + fill: #ffd700; +} + +body.dark-mode .dark-mode-toggle { + color: #ffd700; +} + +/* Responsive Design */ +@media (max-width: 768px) { + .header h1 { + font-size: 32px; + } + + .header h2 { + font-size: 24px; + } + + .github-button-container, + .dark-mode-toggle-container { + top: 10px; + } + + .year-nav a, + .topic-nav a { + margin: 5px 10px; + } + + .footer .content { + flex-direction: column; + align-items: center; + } + + .footer-section { + flex: 1 1 100%; + text-align: center; + } + + .timeline, + .summary-container { + padding: 0 15px; + } +} + +/* Scrollbar Styles */ +::-webkit-scrollbar { + width: 12px; + background-color: #e4ecfd; +} + +::-webkit-scrollbar-thumb { + background-color: #003b7287; +} + +::-webkit-scrollbar-thumb:hover { + background-color: #003b72f7; +} + +body.dark-mode ::-webkit-scrollbar { + background-color: #1f1f1f; +} + +body.dark-mode ::-webkit-scrollbar-thumb { + background-color: #444; +} + +body.dark-mode ::-webkit-scrollbar-thumb:hover { + background-color: #555; +} diff --git a/assets/favicon.png b/assets/favicon.png new file mode 100644 index 0000000..3b5ff88 Binary files /dev/null and b/assets/favicon.png differ diff --git a/assets/js/script.js b/assets/js/script.js new file mode 100644 index 0000000..28ae552 --- /dev/null +++ b/assets/js/script.js @@ -0,0 +1,61 @@ +// Dark Mode Toggle +const darkModeToggle = document.getElementById('dark-mode-toggle'); + +if (darkModeToggle) { + darkModeToggle.addEventListener('click', function () { + document.body.classList.toggle('dark-mode'); + if (document.body.classList.contains('dark-mode')) { + this.textContent = '☀️'; + } else { + this.textContent = '🌙'; + } + }); +} + +// Navigation Active State for Index Page (Year Navigation) +const yearNavLinks = document.querySelectorAll('.year-nav a'); +const yearSections = document.querySelectorAll('.year'); + +if (yearNavLinks.length > 0 && yearSections.length > 0) { + window.addEventListener('scroll', () => { + let currentYear = ''; + + yearSections.forEach(section => { + const sectionTop = section.offsetTop - 200; + if (window.pageYOffset >= sectionTop) { + currentYear = section.getAttribute('id'); + } + }); + + yearNavLinks.forEach(link => { + link.classList.remove('active'); + if (link.getAttribute('href') === '#' + currentYear) { + link.classList.add('active'); + } + }); + }); +} + +// Navigation Active State for Summary Page (Topic Navigation) +const topicNavLinks = document.querySelectorAll('.topic-nav a'); +const topicSections = document.querySelectorAll('.topic-section'); + +if (topicNavLinks.length > 0 && topicSections.length > 0) { + window.addEventListener('scroll', () => { + let currentTopic = ''; + + topicSections.forEach(section => { + const sectionTop = section.offsetTop - 200; + if (window.pageYOffset >= sectionTop) { + currentTopic = section.getAttribute('id'); + } + }); + + topicNavLinks.forEach(link => { + link.classList.remove('active'); + if (link.getAttribute('href') === '#' + currentTopic) { + link.classList.add('active'); + } + }); + }); +} diff --git a/create-info/convert_to_html.py b/create-info/convert_to_html.py deleted file mode 100644 index d443311..0000000 --- a/create-info/convert_to_html.py +++ /dev/null @@ -1,43 +0,0 @@ -def convert_to_html(input_file, output_file): - with open(input_file, 'r') as f: - lines = f.readlines() - - html = [] - current_article = None - - for line in lines: - line = line.strip() - if line.startswith('--'): - # Close the previous article if exists - if current_article: - html.append('') - - month = line.replace('--', '').strip() - year = "2024" # You may want to make this dynamic - current_article = f'
' - html.append(current_article) - html.append(f'\t

{month}

') - elif line.startswith('-'): - event = line.replace('-', '').strip() - # Highlight all text between ** - while '**' in event: - event = event.replace('**', '', 1) - event = event.replace('**', '', 1) - - # Check if the event is special (you may want to define criteria for this) - if "special" in event.lower(): - html.append(f'\t

{event}

') - else: - html.append(f'\t

{event}

') - - # Close the last article if exists - if current_article: - html.append('
') - - with open(output_file, 'w') as f: - f.write('\n'.join(html)) - -# Usage -input_file = 'input.txt' -output_file = 'output.html' -convert_to_html(input_file, output_file) \ No newline at end of file diff --git a/create-info/input.txt b/create-info/input.txt deleted file mode 100644 index 1e6c2f2..0000000 --- a/create-info/input.txt +++ /dev/null @@ -1,7 +0,0 @@ --- November -- **Alibaba** released its new model, **QwQ-32B-Preview**, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's **o1-preview** model. -- **Alibaba** open-sourced the model **Qwen2.5 Coder 32B**, which offers comparable capabilities to leading proprietary language models in the coding domain. -- **DeepSeek** unveiled its new AI model, **DeepSeek-R1-Lite-Preview**, which incorporates reasoning capabilities and delivers impressive performance on the **AIME** and **MATH** benchmarks, matching the level of OpenAI's **o1-preview**. -- **Suno** upgraded its AI-powered music generator to **v4**, introducing new features and performance improvements. -- **Mistral AI** launched the **Pixtral Large** model, a multimodal language model excelling in image recognition and advanced performance metrics. -- **Google** introduced two experimental models, **gemini-exp-1114** and **gemini-exp-1121**, currently leading the chatbot landscape with enhanced performance. \ No newline at end of file diff --git a/create-info/output.html b/create-info/output.html deleted file mode 100644 index af7466c..0000000 --- a/create-info/output.html +++ /dev/null @@ -1,9 +0,0 @@ -
-

November

-

Alibaba released its new model, QwQ32BPreview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1preview model.

-

Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.

-

DeepSeek unveiled its new AI model, DeepSeekR1LitePreview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1preview.

-

Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.

-

Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.

-

Google introduced two experimental models, geminiexp1114 and geminiexp1121, currently leading the chatbot landscape with enhanced performance.

-
\ No newline at end of file diff --git a/create-info/readme.txt b/create-info/readme.txt deleted file mode 100644 index db433e3..0000000 --- a/create-info/readme.txt +++ /dev/null @@ -1,17 +0,0 @@ --- How to Fill Out This File -- - -Use this file to enter events organized by month. Each month should start with a line containing two dashes (--), followed by the month and year (e.g., "-- February 2024"). - -Each event within a month should start with a line containing one dash (-), followed by the event description. - -To highlight text within an event, wrap the text you want to highlight with double asterisks (**). This will be converted to bold text in the HTML output. - -Example: - --- February 2024 -- **Stability AI** announces Stable Diffusion 3 -- Google announces the **Gemini Pro 1.5** multimodal language model - --- March 2024 -- X Corporation announces the Grok 1.5 open source model -- Anthropic announces version 3 of the Claude language model \ No newline at end of file diff --git a/index.html b/index.html deleted file mode 100644 index 5d097ff..0000000 --- a/index.html +++ /dev/null @@ -1,306 +0,0 @@ - - - - - - - - - - AI Timeline - - - - - -
-
- - - -
-
- -
-

Artificial Intelligence Timeline

-

2022 - Present

-
- - - -
- -
-

2022

- -
-

February

-

Midjourney v1

-
- -
-

April

-

Midjourney v2

-

DALL-E 2 is announced for gradual release.

-
- -
-

July

-

Midjourney v3 is launched.

-
- -
-

August

-

Stable Diffusion 1.4 is released.

-
- -
-

October

-

Stable Diffusion 1.5 becomes available.

-
- -
-

November

-

Midjourney v4 is released.

-

Stable Diffusion 2.0 is launched.

-

ChatGPT 3.5, a large language model by OpenAI, is released to the public and quickly becomes a viral sensation.

-
- -
-

December

-

Stable Diffusion 2.1 is released.

-
-
- - -
-

2023

- -
-

February

-

Meta releases the LLaMA language model as open-source for research purposes. The model is later leaked.

-

Microsoft gradually releases Bing AI, an AI chat based on an upgraded GPT model integrating internet search.

-
- -
-

March

-

Midjourney v5 is launched.

-

OpenAI's GPT-4 model is partially released, featuring multimodal image analysis and improved multi-language support.

-

Google releases the AI chat Bard in a limited capacity, based on the LaMDA language model.

-
- -
-

April

-

Adobe releases the Firefly image creation model as a beta version to a waiting list. The model allowed a variety of capabilities including text formatting.

-
- -
-

May

-

Midjourney v5.1 is released.

-

Google announces an upgrade to Bard, moving it to the upgraded PaLM 2 language model. It will support 180 countries and many languages.

-
- -
-

June

-

Midjourney v5.2 is launched.

-
- -
-

July

-

Stable Diffusion XL 1.0 is released.

-

Anthropic announces a new version of their large language model - Claude 2.

-

Meta releases the LLaMA 2 open source language model to the general public in a variety of sizes.

-
- -
-

October

-

DALL-E 3 is released.

-

Adobe releases Firefly 2.

-
- -
-

November

-

Stable Diffusion XL Turbo is released - A fast model that allows the creation of an image in one step in real time.

-
- -
-

December

-

Midjourney v6 is launched.

-

Google upgrades Bard in limited areas, moving it to be based on the upgraded Gemini Pro language model.

-

X Corporation launches Grok AI chatbot for paid subscribers in English language.

-
-
- - -
-

2024

- -
-

February

-

Stability AI announces Stable Diffusion 3 (gradually released to waiting list).

-

Google upgrades the artificial intelligence chat in Bard, basing it on the new Gemini Pro model, in all available languages. Google replaces "Bard" with "Gemini".

-

Google announces the Gemini Pro 1.5 multimodal language model capable of parsing up to a million tokens, as well as parsing video and images. The model is gradually released to developers on a waiting list.

-

OpenAI announces the Sora model that produces videos up to a minute long. The model is not released to the public at this time.

-
- -
-

March

-

X Corporation announces the upcoming release of the Grok 1.5 open source model.

-

Anthropic announces Claude 3, a new version of their large language model. The version is deployed in 3 different sizes, with the largest model performing better than GPT-4.

-

Suno AI, which develops a model for creating music, releases Suno v3 to the general public.

-
- -
-

April

-

Stability AI releases a new update to the music creation model - Stable Audio 2.0.

-

X Corporation releases an upgrade to its language model, Grok-1.5V, which integrates high-level image recognition. In the test presented by the company, the model is the best in identifying and analyzing images compared to other models.

-

The Mistral company releases its new model Mixtral 8x22B as open source. This is the most powerful model among the open source models and it contains 141 billion parameters but uses a method that allows more economical use.

-

Meta releases the LLaMA 3 model as open source in sizes 8B and 70B parameters. The large model shows better performance than Claude 3 Sonnet and Gemini Pro 1.5 in several measures. Meta is expected to later release larger models with 400 billion parameters and more.

-

Microsoft releases the Phi-3-mini model in open source. The model comes in a reduced version of 3.8B parameters, which allows it to run on mobile devices as well, and it presents capabilities similar to GPT-3.5.

-

Adobe announces its new image creation model Firefly 3.

-

The startup Reka AI presents a series of multimodal language models in 3 sizes. The models are capable of processing video, audio and images. The large model featured similar capabilities to GPT-4.

-

Apple releases as full open source a series of small language models under the name OpenELM. The models are available in four weights between 270 million and 3 billion parameters.

-
- -
-

May

-

OpenAI announces the GPT-4-O model that presents full multimodal capabilities, including receiving and creating text, images, and audio. The model presents an impressive ability to speak with a high response speed and in natural language. The model is 2 times more efficient than the GPT-4 Turbo model, and has better capabilities for languages other than English.

-

Google announces a large number of AI features in its products. The main ones: increasing the token limit to 2 million for Gemini 1.5 to waiting list, releasing a smaller and faster Gemini Flash 1.5 model. Revealing the latest image creation model Imagen 3, music creation model Music AI and video creation model Veo. And the announcement of the Astra model with multimodal capabilities for realtime audio and video reception.

-

Microsoft announces Copilot+ for dedicated computers, which will allow a full search of the user's history through screenshots of the user's activity. The company also released as open source the SLMs that display impressive capabilities in a minimal size: Phi-3 Small, Phi-3 Medium, and Phi-3 Vision which includes image recognition capability.

-

Meta introduces Chameleon, a new multimodal model that seamlessly renders text and images.

-

Mistral AI releases a new open source version of its language model Mistral-7B-Instruct-v0.3.

-

Google announces AI Overviews intended to give a summary of the relevant information in Google search.

-

Suno AI releases an updated music creation model Suno v3.5.

-

Mistral AI releases a new language model designed for coding Codestral in size 22B.

-
- -
-

June

-

Stability AI releases its updated image creation model Stable Diffusion 3 in a medium version in size 2B parameters.

-

Apple announces Apple Intelligence, an AI system that will be integrated into the company's devices and will combine AI models of different sizes for different tasks.

-

DeepSeekAI publishes the DeepSeekCoderV2 open source language model which presents similar coding capabilities to models such as GPT-4, Claude 3 Opus and more.

-

Runway introduces Gen3 Alpha, a new AI model for video generation.

-

Anthropic releases the Claude Sonnet 3.5 model, which presents better capabilities than other models with low resource usage.

-

Microsoft releases in open source a series of image recognition models called Florence 2.

-

Google announces Gemma 2 open source language models with 9B and 27B parameter sizes. Also, the company opens the context window capabilities to developers for up to 2 million tokens.

-
- -
-

July

-

OpenAI has released a miniaturized model called gpt4o mini that presents high capabilities at a low cost

-

Meta releases as open source the llama 3.1 model in sizes 8B 70B and 405B. The large model features the same capabilities as the best closed source models

-

mistral ai releases three new models: Codestral Mamba, Mistral NeMo and Mathstral designed for mathematics

-

Google DeepMind has unveiled two new AI systems that won silver medals at this year's International Mathematical Olympiad (IMO), AlphaProof and AlphaGeometry 2.

-

OpenAI launched SearchGPT, an integrated web search

-

Startup Udio has released Udio v1.5, an updated version of its music creation model

-

mistral ai has released a large language model Mistral Large 2 in size 123B, which presents capabilities close to the closed SOTA models.

-

Midjourney v6.1 is released

-

Google releases the Gemma 2 2B model as open source. The model demonstrates better capabilities than much larger models.

-
- -
-

August

-

"Black Forest Labs" releases weights for an image creation model named Flux, which shows better performance than similar closedsource models.

-

OpenAI released a new version of its model, gpt4o 0806, achieving 100% success in generating valid JSON output.

-

Google's image generation model, Imagen 3, has been released.

-

xAI Corporation has launched the models Grok 2 and Grok 2 mini, which demonstrate performance on par with leading SOTA models in the market.

-

Microsoft has introduced its small language models, Phi 3.5, in three versions, each showcasing impressive performance relative to their size.

-

Google has introduced three new experimental AI models: Gemini 1.5 Flash8B, Gemini 1.5 Pro Enhanced, and Gemini 1.5 Flash Updated.

-

Ideogram 2.0 has been released, offering image generation capabilities that surpass those of other leading models.

-

Luma has unveiled the Dream Machine 1.5 model for video creation.

-
- -
-

September

-

The French AI company Mistral has introduced Pixtral12B, its first multimodal model capable of processing both images and text.

-

OPENAI has released two nextgeneration AI models to its subscribers: o1 preview and o1 mini. These models show a significant improvement in performance, particularly in tasks requiring reasoning, including coding, mathematics, GPQA, and more.

-

Chinese company Alibaba releases the Qwen 2.5 model in various sizes, ranging from 0.5B to 72B. The models demonstrate capabilities comparable to much larger models.

-

The video generation model KLING 1.5 has been released.

-

OpenAI launches the advanced voice mode of GPT4o for all subscribers.

-

Meta releases Llama 3.2 in sizes 1B, 3B, 11B, and 90B, featuring image recognition capabilities for the first time.

-

Google has rolled out new model updates ready for deployment, Gemini Pro 1.5 002 and Gemini Flash 1.5 002, showcasing significantly improved longcontext processing.

-

Kyutai releases two opensource versions of its voicetovoice model, Moshi.

-

Google releases an update to its AI tool NotebookLM that enables users to create podcasts based on their own content.

-

Mistral AI launches a 22B model named Mistral Small.

-
- -
-

October

-

Flux 1.1 Pro is released, showcasing advanced capabilities for image creation.

-

Meta unveils Movie Gen, a new AI model that generates videos, images, and audio from text input.

-

Pika introduces Video Model 1.5 along with "Pika Effects."

-

Adobe announces its video creation model, Firefly Video.

-

Startup Rhymes AI releases Aria, an opensource, multimodal model exhibiting capabilities similar to comparably sized proprietary models.

-

Meta releases an opensource speechtospeech language model named Meta Spirit LM.

-

Mistral AI introduces Ministral, a new model available in 3B and 8B parameter sizes.

-

Janus AI, a multimodal language model capable of recognizing and generating both text and images, is released as open source by DeepSeekAI.

-

Google DeepMind and MIT unveil Fluid, a texttoimage generation model with industryleading performance at a scale of 10.5B parameters.

-

Stable Diffusion 3.5 is released in three sizes as open source.

-

Anthropic launches Claude 3.5 Sonnet New, demonstrating significant advancements in specific areas over its previous version, and announces Claude 3.5 Haiku.

-

Anthropic announces an experimental feature for computer use with a public beta API.

-

The texttoimage model Recraft v3 has been released to the public, ranking first in benchmarks compared to similar models.

-

OpenAI has launched Search GPT, allowing users to perform web searches directly within the platform.

-
- -
-

November

-

Alibaba released its new model, QwQ 32B Preview, which integrates reasoning capabilities before responding. The model competes with, and sometimes surpasses, OpenAI's o1-preview model.

-

Alibaba opensourced the model Qwen2.5 Coder 32B, which offers comparable capabilities to leading proprietary language models in the coding domain.

-

DeepSeek unveiled its new AI model, DeepSeek-R1-Lite-Preview, which incorporates reasoning capabilities and delivers impressive performance on the AIME and MATH benchmarks, matching the level of OpenAI's o1-preview.

-

Suno upgraded its AIpowered music generator to v4, introducing new features and performance improvements.

-

Mistral AI launched the Pixtral Large model, a multimodal language model excelling in image recognition and advanced performance metrics.

-

Google introduced two experimental models, gemini-exp-1114 and gemini-exp-1121, currently leading the chatbot landscape with enhanced performance.

-
- -
- -
- - - - - - - - \ No newline at end of file diff --git a/index.md b/index.md new file mode 100644 index 0000000..9ff5b63 --- /dev/null +++ b/index.md @@ -0,0 +1,62 @@ +--- +layout: default +title: AI Timeline +description: A comprehensive timeline of Artificial Intelligence milestones from 2022 to present. +--- + +
+
+ + + +
+
+ +
+

Artificial Intelligence Timeline

+

2022 - Present

+
+ + + +
+ + {% for year in site.data.timeline %} +
+

{{ year.year }}

+ + {% for event in year.events %} +
+

{{ event.date | date: "%B" }}

+ {% for info in event.info %} +

{{ info.text }}

+ {% endfor %} +
+ {% endfor %} +
+ {% endfor %} + +
+ + + + + + \ No newline at end of file diff --git a/legacy/summary.html b/legacy/summary.html new file mode 100644 index 0000000..928c145 --- /dev/null +++ b/legacy/summary.html @@ -0,0 +1,341 @@ + + + + + + + AI Developments Summary + + + + + + + + + + + + +
+
+ + + + +
+
+ +
+

AI Developments Summary

+

2022 - 2024

+
+ + + + + +
+ +
+

Language Models

+ + +
+

2022

+
    +
  • November: ChatGPT 3.5 by OpenAI was released to the public and quickly became a viral sensation.
  • +
+
+ + +
+

2023

+
    +
  • February: LLaMA by Meta was released as an open-source language model for research purposes and was later leaked.
  • +
  • March: GPT-4 by OpenAI was partially released, featuring multimodal image analysis and improved multi-language support.
  • +
  • July: Claude 2 by Anthropic was announced as a new version of their large language model.
  • +
+
+ + +
+

2024

+
    +
  • March: Claude 3 by Anthropic was announced, featuring improved capabilities.
  • +
  • May: GPT-4-O by OpenAI was announced with full multimodal capabilities.
  • +
  • April: LLaMA 3 by Meta was released as an open-source model in sizes 8B and 70B parameters.
  • +
  • July: LLaMA 3.1 by Meta was released in sizes 8B, 70B, and 405B, featuring capabilities comparable to the best closed-source models.
  • +
  • March: Grok 1.5 by X Corporation was announced as an upcoming open-source model.
  • +
  • April: Grok-1.5V was released, integrating high-level image recognition.
  • +
  • April: Mixtral 8x22B by Mistral AI was released as an open-source model.
  • +
  • May: Phi-3-mini by Microsoft was released as an open-source model.
  • +
  • April: OpenELM by Apple was released as a series of small language models.
  • +
  • July: gpt4o mini by OpenAI was released, offering high capabilities at a low cost.
  • +
  • August: gpt4o 0806 by OpenAI was released, achieving 100% success in generating valid JSON output.
  • +
  • August: LTM2mini by Magic AI was developed, capable of working with a context window of 100 million tokens.
  • +
  • September: o1 preview and o1 mini by OpenAI were released, showing significant improvements in reasoning tasks.
  • +
  • July: Mistral Large 2 by Mistral AI was released, presenting capabilities close to closed-source models.
  • +
  • July: AlphaProof and AlphaGeometry 2 by Google DeepMind were unveiled, winning silver medals at the International Mathematical Olympiad.
  • +
+
+
+ + +
+

Image Models

+ + +
+

2022

+
    +
  • February: Midjourney v1 was released.
  • +
  • April: Midjourney v2 was released.
  • +
  • April: DALL-E 2 was announced for gradual release.
  • +
  • July: Midjourney v3 was launched.
  • +
  • August: Stable Diffusion 1.4 was released.
  • +
  • October: Stable Diffusion 1.5 became available.
  • +
  • November: Midjourney v4 was released.
  • +
  • November: Stable Diffusion 2.0 was launched.
  • +
  • December: Stable Diffusion 2.1 was released.
  • +
+
+ + +
+

2023

+
    +
  • March: Midjourney v5 was launched.
  • +
  • May: Midjourney v5.1 was released.
  • +
  • June: Midjourney v5.2 was launched.
  • +
  • July: Stable Diffusion XL 1.0 was released.
  • +
  • October: DALL-E 3 was released.
  • +
  • October: Adobe Firefly 2 was released.
  • +
  • November: Stable Diffusion XL Turbo was released, allowing real-time image creation.
  • +
+
+ + +
+

2024

+
    +
  • February: Stable Diffusion 3 was announced and gradually released.
  • +
  • December 2023: Midjourney v6 was launched.
  • +
  • July: Midjourney v6.1 was released.
  • +
  • May: Firefly 3 by Adobe was announced.
  • +
  • August: Flux by Black Forest Labs was released, outperforming similar models.
  • +
  • August: Imagen 3 by Google was released.
  • +
  • August: Ideogram 2.0 was released, offering superior image generation capabilities.
  • +
  • April: Chameleon by Meta was introduced, seamlessly rendering text and images.
  • +
+
+
+ + +
+

Video Models

+ + +
+

2024

+
    +
  • February: Sora by OpenAI was announced, capable of producing videos up to a minute long.
  • +
  • May: Veo by Google was announced.
  • +
  • June: Gen3 Alpha by Runway was introduced.
  • +
  • August: Dream Machine 1.5 by Luma was unveiled.
  • +
+
+
+ + +
+

Music Models

+ + +
+

2023

+
    +
  • March: Suno v3 by Suno AI was released to the public.
  • +
+
+ + +
+

2024

+
    +
  • April: Stable Audio 2.0 by Stability AI was released.
  • +
  • May: Music AI by Google was announced.
  • +
  • May: Suno v3.5 was released.
  • +
  • July: Udio v1.5 was released, an updated music creation model.
  • +
+
+
+ + +
+

Multimodal Models

+ + +
+

2023

+
    +
  • March: GPT-4 by OpenAI featured multimodal image analysis.
  • +
  • March: Bard by Google was released with limited capabilities.
  • +
+
+ + +
+

2024

+
    +
  • February: Gemini Pro 1.5 by Google was announced, capable of parsing up to a million tokens, images, and videos.
  • +
  • April: Chameleon by Meta was introduced, seamlessly rendering text and images.
  • +
  • May: Astra by Google was announced, a multimodal model for real-time audio and video reception.
  • +
  • June: DeepSeekCoderV2 by DeepSeekAI was published, an open-source model with impressive capabilities.
  • +
  • August: Pixtral12B by Mistral was introduced, capable of processing both images and text.
  • +
  • May: Reka AI presented a series of multimodal language models capable of processing video, audio, and images.
  • +
+
+
+ + +
+

Chatbots

+ + +
+

2022

+
    +
  • November: ChatGPT 3.5 by OpenAI was released.
  • +
+
+ + +
+

2023

+
    +
  • February: Bing AI by Microsoft was gradually released, integrating internet search.
  • +
  • March: Bard by Google was released in a limited capacity.
  • +
+
+ + +
+

2024

+
    +
  • February: Grok AI by X Corporation was launched.
  • +
  • May: SearchGPT by OpenAI was launched, an integrated web search.
  • +
  • May: AI Overviews by Google was announced for Google Search.
  • +
  • May: Copilot+ by Microsoft was announced, allowing full search of user history through screenshots.
  • +
+
+
+ + +
+

Open-Source Models

+ + +
+

2022

+
    +
  • August: Stable Diffusion 1.4 was released.
  • +
  • October: Stable Diffusion 1.5 was released.
  • +
+
+ + +
+

2023

+
    +
  • February: LLaMA by Meta was released and later leaked.
  • +
+
+ + +
+

2024

+
    +
  • April: LLaMA 3 by Meta was released.
  • +
  • July: LLaMA 3.1 by Meta was released.
  • +
  • July: Mistral Large 2 by Mistral AI was released.
  • +
  • April: Mixtral 8x22B by Mistral AI was released.
  • +
  • June: Gemma 2 by Google was released.
  • +
  • May: Phi-3-mini by Microsoft was released.
  • +
  • April: OpenELM by Apple was released.
  • +
  • June: DeepSeekCoderV2 by DeepSeekAI was published.
  • +
  • August: Pixtral12B by Mistral was introduced.
  • +
  • May: Codestral by Mistral AI was released.
  • +
  • July: Codestral Mamba, Mistral NeMo, and Mathstral were released by Mistral AI.
  • +
  • May: Florence 2 image recognition models by Microsoft were released.
  • +
+
+
+
+ + + + + + + +