What's new in Generative AI land
Regularly updated sources
- https://www.reddit.com/r/LocalLLaMA/
- https://github.com/nichtdax/awesome-totally-open-chatgpt
- https://github.com/zhengzangw/awesome-huge-models
- https://huggingface.co/blog
- https://eng.lyft.com
Big tech news
Adobe
-
2023-03-21🧨 https://firefly.adobe.com/ -
2023-05-23🧨 https://www.adobe.com/products/photoshop/generative-fill.html -
2023-10-10🧨 Firefly 2
AI21 labs
- Jurassic models
-
2024-03-28https://www.ai21.com/jamba
Amazon
-
2012-12-03https://aws.amazon.com/about-aws/whats-new/2019/12/introducing-amazon-sagemaker-autopilot/ -
2021-11-30https://aws.amazon.com/blogs/aws/announcing-amazon-sagemaker-canvas-a-visual-no-code-machine-learning-capability-for-business-analysts/ -
2023-04-13https://aws.amazon.com/blogs/machine-learning/announcing-new-tools-for-building-with-generative-ai-on-aws- Amazon Bedrock
- 🖥️ Amazon CodeWhisperer
-
2023-04-17https://aws.amazon.com/blogs/machine-learning/deploy-large-models-at-high-performance-using-fastertransformer-on-amazon-sagemaker/ -
2023-05-03https://aws.amazon.com/blogs/machine-learning/quickly-build-high-accuracy-generative-ai-applications-on-enterprise-data-using-amazon-kendra-langchain-and-large-language-models/ -
2023-05-10https://aws.amazon.com/blogs/machine-learning/announcing-new-jupyter-contributions-by-aws-to-democratize-generative-ai-and-scale-ml-workloads/ -
2023-06-29https://aws.amazon.com/blogs/machine-learning/interactively-fine-tune-falcon-40b-and-other-llms-on-amazon-sagemaker-studio-notebooks-using-qlora/ -
2023-07-05https://github.com/aws-samples/amazon-bedrock-samples -
2023-07-18https://aws.amazon.com/blogs/machine-learning/llama-2-foundation-models-from-meta-are-now-available-in-amazon-sagemaker-jumpstart/ -
2023-08-16https://aws.amazon.com/blogs/machine-learning/how-thomson-reuters-developed-open-arena-an-enterprise-grade-large-language-model-playground-in-under-6-weeks -
2023-09-28https://www.aboutamazon.com/news/aws/aws-amazon-bedrock-general-availability-generative-ai-innovations -
2023-12-01AWS re:Invent- https://aws.amazon.com/q
- https://press.aboutamazon.com/2023/11/aws-unveils-next-generation-aws-designed-chips
- https://aws.amazon.com/sagemaker/clarify/
- https://aws.amazon.com/sagemaker/canvas/
- https://aws.amazon.com/blogs/aws/introducing-amazon-sagemaker-hyperpod-a-purpose-built-infrastructure-for-distributed-training-at-scale/
-
2024-07-10https://aws.amazon.com/appstudio/
Andrej Karpathy / Eureka Labs
-
2015-05-21https://karpathy.github.io/2015/05/21/rnn-effectiveness/ -
2022-08-16https://karpathy.ai/zero-to-hero.html -
2023-04-09BabyGPT -
2023-05-23State of GPT -
2023-07-23https://github.com/karpathy/llama2.c -
2023-11-231h intro to LLMs -
2024-02-20Let's build the GPT Tokenizer -
2024-02-20https://github.com/karpathy/minbpe - https://github.com/karpathy/llm.c
-
2024-07-11train GPT-2 for $672 -
2024-07-16Announcing Eureka Labs
Anthropic
-
2023-03-14https://www.anthropic.com/index/introducing-claude -
2023-05-11https://www.anthropic.com/index/100k-context-windows -
2023-07-11https://www.anthropic.com/index/claude-2 -
2023-09-25https://www.anthropic.com/index/anthropic-amazon -
2023-10-27https://www.wsj.com/tech/ai/google-commits-2-billion-in-funding-to-ai-startup-anthropic-db4d4c50 -
2023-11-21https://www.anthropic.com/index/claude-2-1 -
2024-03-04https://www.anthropic.com/news/claude-3-family- Haiku
- Sonnet
- Opus
-
2024-03-13https://www.anthropic.com/news/claude-3-haiku -
2024-03-27https://www.aboutamazon.com/news/company-news/amazon-anthropic-ai-investment - https://docs.anthropic.com/en/prompt-library/library
- https://github.com/anthropics/anthropic-cookbook
-
2024-06-21https://www.anthropic.com/news/claude-3-5-sonnet -
2024-07-16https://www.anthropic.com/news/android-app
Apple
- https://developer.apple.com/metal
- https://github.com/ml-explore/mlx
- https://github.com/ml-explore/mlx-examples
- https://github.com/apple/ml-ferret
-
2024-03-14https://machinelearning.apple.com/research/mm1-methods-analysis-insights -
2024-04-22https://machinelearning.apple.com/research/openelm
Black Forest Labs
-
2024-08--1https://blackforestlabs.ai/announcing-black-forest-labs/- Stability.ai spin-off
Chip Huyen
- https://huyenchip.com/ml-interviews-book/
-
2022-01-03https://stanford-cs329s.github.io -
2022-05-01https://www.oreilly.com/library/view/designing-machine-learning/9781098107956/ -
2023-04-11https://huyenchip.com/2023/04/11/llm-engineering.html -
2023-05-02https://huyenchip.com/2023/05/02/rlhf.html -
2023-06-07https://huyenchip.com/2023/06/07/generative-ai-strategy.html -
2023-08-16https://huyenchip.com/2023/08/16/llm-research-open-challenges.html -
2024-01-16https://huyenchip.com/2024/01/16/sampling.html -
2024-07-25https://huyenchip.com/2024/07/25/genai-platform.html - https://www.oreilly.com/library/view/ai-engineering/9781098166298/
Cohere
-
2024-02-13https://txt.cohere.com/aya/ -
2024-03-11https://cohere.com/blog/command-r -
2024-04-04https://cohere.com/blog/command-r-plus-microsoft-azure
Databricks
-
2023-06-09https://github.com/databrickslabs/pyspark-ai -
2023-06-26https://www.wsj.com/articles/databricks-strikes-1-3-billion-deal-for-generative-ai-startup-mosaicml-fdcefc06 -
2024-03-27https://www.databricks.com/blog/introducing-dbrx-new-state-art-open-llm
- https://www.tensorflow.org
-
2017-06-12Attention is all you need -
2022-12-13🧨 https://imagen.research.google/editor/ -
2023-05-01https://www.nytimes.com/2023/05/01/technology/ai-google-chatbot-engineer-quits-hinton.html -
2023-05-04✨ https://www.semianalysis.com/p/google-we-have-no-moat-and-neither -
2023-05-10https://blog.google/technology/ai/google-io-2023-keynote-sundar-pichai/#palm-2-gemini -
2023-05-10https://cloud.google.com/ai/generative-ai -
2023-05-11https://workspace.google.com/blog/product-announcements/duet-ai - https://developers.google.com/mediapipe/solutions
-
2023-06-29🧨 https://ai.googleblog.com/2023/06/on-device-diffusion-plugins-for.html -
2023-07-12https://blog.google/technology/ai/notebooklm-google-ai/ -
2023-08-08https://idx.dev -
2023-08-29🧨 https://www.deepmind.com/blog/identifying-ai-generated-images-with-synthid -
2023-08-29https://cloud.google.com/blog/products/ai-machine-learning/vertex-ai-colab-enterprise-and-mlops -
2023-10-12https://blog.google/products/search/google-search-generative-ai-october-update/ -
2023-11-06https://storage.googleapis.com/deepmind-media/AlphaCode2/AlphaCode2_Tech_Report.pdf -
2023-11-16https://deepmind.google/discover/blog/transforming-the-future-of-music-creation/- https://blog.youtube/inside-youtube/ai-and-music-experiment/
- DeepMind Lyria
- no paper or code yet
-
2023-12-06https://blog.google/technology/ai/google-gemini-ai/ -
2023-12-06https://blog.google/products/bard/google-bard-try-gemini-ai/ -
2023-12-08https://blog.google/technology/ai/notebooklm-new-features-availability/ -
2023-12-13https://blog.google/technology/ai/google-gemini-pro-imagen-duet-ai-update/ -
2023-12-13https://blog.google/technology/ai/gemini-api-developers-cloud/ -
2023-12-13https://cloud.google.com/blog/products/ai-machine-learning/gemini-support-on-vertex-ai -
2023-12-13https://cloud.google.com/blog/topics/healthcare-life-sciences/introducing-medlm-for-the-healthcare-industry -
2023-12-13https://cloud.google.com/blog/products/ai-machine-learning/duet-ai-for-developers-and-in-security-operations-now-ga -
2023-12-14🧨 https://cloud.google.com/blog/products/ai-machine-learning/imagen-2-on-vertex-ai-is-now-generally-available -
2024-01-03🧨 https://instruct-imagen.github.io/ -
2024-01-17https://deepmind.google/discover/blog/alphageometry-an-olympiad-level-ai-system-for-geometry/ -
2024-01-23📽️ https://lumiere-video.github.io/ -
2024-02-08https://blog.google/products/gemini/bard-gemini-advanced-app/- Bard renamed to Gemini
- launch of Gemini mobile apps
- launch of Gemini Advanced (cf. ChatGPT Plus?) with Ultra 1.0 model
-
2024-02-15https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024/ -
2024-02-21https://blog.google/technology/developers/gemma-open-models/ - https://aitestkitchen.withgoogle.com/tools/image-fx
- https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/
-
2024-02-23https://sites.google.com/view/genie-2024 -
2024-05-08https://blog.google/technology/ai/google-deepmind-isomorphic-alphafold-3-ai-model/ -
2024-05-14https://blog.google/technology/ai/google-gemini-update-flash-ai-assistant-io-2024/ -
2024-05-14https://developers.googleblog.com/en/start-building-with-project-idx-today/ -
2024-06-17🎧 https://deepmind.google/discover/blog/generating-audio-for-video/ -
2024-07-25https://blog.google/products/gemini/google-gemini-new-features-july-2024/ -
2024-07-25https://deepmind.google/discover/blog/ai-solves-imo-problems-at-silver-medal-level/
Huggingface
-
2023-04-25https://huggingface.co/chat -
2023-05-09https://huggingface.co/blog/starchat-alpha -
2023-05-10https://huggingface.co/docs/transformers/transformers_agents -
2023-05-31https://huggingface.co/blog/sagemaker-huggingface-llm -
2024-02-02https://huggingface.co/chat/assistants
Inflection
- https://inflection.ai
- https://pi.ai
-
2023-06-22https://inflection.ai/inflection-1 -
2023-11-22https://inflection.ai/inflection-2 -
2024-03-07https://inflection.ai/inflection-2-5
Lightning AI
-
2023-04-26https://lightning.ai/pages/community/tutorial/lora-llm/ -
2023-04-28https://lightning.ai/pages/community/community-discussions/the-ultimate-battle-of-language-models-lit-llama-vs-gpt3.5-vs-bloom-vs/ -
2023-05-05https://github.com/Lightning-AI/lit-gpt -
2024-02-18https://lightning.ai/lightning-ai/studios/code-lora-from-scratch?view=public§ion=all - https://github.com/Lightning-AI/lightning-thunder
Meta
-
2016-07-15https://github.com/facebookresearch/fastText/ -
2016-09-xx✨ https://pytorch.org -
2017-02-28https://github.com/facebookresearch/faiss -
2017-04-18https://caffe2.ai/blog/2017/04/18/caffe2-open-source-announcement.html -
2017-09-07https://research.facebook.com/blog/2017/9/facebook-and-microsoft-introduce-new-open-ecosystem-for-interchangeable-ai-frameworks/- ONNX
-
2018-05-02https://caffe2.ai/blog/2018/05/02/Caffe2_PyTorch_1_0.html -
2021-07-15https://engineering.fb.com/2021/07/15/open-source/fsdp/ -
2021-xx-xxhttps://github.com/facebookresearch/fairscale -
2022-xx-xxhttps://github.com/facebookresearch/xformers -
2022-09-12https://pytorch.org/blog/PyTorchfoundation/ -
2022-05-03https://huggingface.co/facebook/opt-30b -
2022-05-03https://ai.meta.com/blog/democratizing-access-to-large-scale-language-models-with-opt-175b/ -
2022-11-16https://huggingface.co/facebook/galactica-120b -
2023-02-24✨ https://ai.meta.com/blog/large-language-model-llama-meta-ai/ -
2023-03-15https://pytorch.org/blog/pytorch-2.0-release/ -
2023-04-05https://ai.meta.com/blog/segment-anything-foundation-model-image-segmentation/ -
2023-04-25https://ai.facebook.com/blog/self-supervised-learning-practical-guide -
2023-04-17https://ai.meta.com/blog/dino-v2-computer-vision-self-supervised-learning/ -
2023-05-09https://ai.meta.com/blog/imagebind-six-modalities-binding-ai/ -
2023-07-18https://ai.meta.com/blog/llama-2/ -
2023-08-22https://ai.meta.com/blog/seamless-m4t/ -
2023-08-24https://ai.meta.com/blog/code-llama-large-language-model-coding/ -
2023-08-25https://facebookresearch.github.io/nougat/ -
2023-09-27https://about.fb.com/news/2023/09/introducing-ai-powered-assistants-characters-and-creative-tools/ -
2023-11-16🧨 https://emu-video.metademolab.com - 🧨 https://imagine.meta.com
-
2023-11-30https://ai.meta.com/blog/seamless-communication/ -
2024-01-29CodeLlama-70B -
2024-03-12https://engineering.fb.com/2024/03/12/data-center-engineering/building-metas-genai-infrastructure/ -
2024-03-19https://www.bloomberg.com/news/articles/2024-03-19/microsoft-hires-deepmind-co-founder-suleyman-to-run-consumer-ai -
2024-03-21https://www.bloomberg.com/news/articles/2024-03-21/microsoft-to-pay-inflection-ai-650-million-after-scooping-up-most-of-staff -
2024-04-18https://ai.meta.com/blog/meta-llama-3/ -
2024-06-12https://engineering.fb.com/2024/06/12/data-infrastructure/training-large-language-models-at-scale-meta/ -
2024-07-02https://ai.meta.com/research/publications/meta-3d-gen/ -
2024-07-23https://ai.meta.com/blog/meta-llama-3-1/ -
2024-07-29https://ai.meta.com/research/publications/sam-2-segment-anything-in-images-and-videos/
Microsoft
-
2021-06-17https://github.com/microsoft/LoRA -
2022-02-23🖥️ https://github.blog/news-insights/product-news/introducing-github-copilot-ai-pair-programmer/ -
2023-02-07New Bing -
2023-03-16https://www.microsoft.com/en-us/microsoft-365/blog/2023/03/16/introducing-microsoft-365-copilot-a-whole-new-way-to-work/ -
2023-03-22🖥️ https://github.blog/2023-03-22-github-copilot-x-the-ai-powered-developer-experience/ -
2023-03-28https://blogs.microsoft.com/blog/2023/03/28/introducing-microsoft-security-copilot-empowering-defenders-at-the-speed-of-ai/ -
2023-03-21Bing Image Creator -
2023-07-18https://www.microsoft.com/en-us/microsoft-365/blog/2023/07/18/introducing-bing-chat-enterprise-microsoft-365-copilot-pricing-and-microsoft-sales-copilot/ -
2023-07-20https://github.com/microsoft/promptflow -
2023-09-07https://blogs.microsoft.com/on-the-issues/2023/09/07/copilot-copyright-commitment-ai-legal-concerns/ -
2023-09-20🖥️ https://github.blog/2023-09-20-github-copilot-chat-beta-now-available-for-all-individuals/ -
2023-12-12https://www.microsoft.com/en-us/research/blog/phi-2-the-surprising-power-of-small-language-models/ -
2023-12-29🖥️ https://github.blog/2023-12-29-github-copilot-chat-now-generally-available-for-organizations-and-individuals/ -
2024-01-15https://blogs.microsoft.com/blog/2024/01/15/bringing-the-full-power-of-copilot-to-more-people-and-businesses/
Midjourney
-
2023-12-21🧨 https://mid-journey.ai/midjourney-v6-release/
Mistral
-
2023-09-27https://mistral.ai/news/announcing-mistral-7b/ -
2023-12-11https://mistral.ai/news/mixtral-of-experts/ -
2023-12-11https://mistral.ai/news/la-plateforme/ -
2024-02-26https://mistral.ai/news/mistral-large/ -
2024-04-17https://mistral.ai/news/mixtral-8x22b/ -
2024-05-29https://mistral.ai/news/codestral/ -
2024-07-16https://mistral.ai/news/codestral-mamba/ -
2024-07-16https://mistral.ai/news/mathstral/ -
2024-07-18https://mistral.ai/news/mistral-nemo/ -
2024-07-24https://mistral.ai/news/mistral-large-2407/
NVIDIA
- https://github.com/NVIDIA/TensorRT-LLM
-
2024-02-13https://blogs.nvidia.com/blog/chat-with-rtx-available-now/ - NVIDIA NIM
Ollama
-
2023-07-08https://github.com/jmorganca/ollama -
2024-02-08https://ollama.com/blog/openai-compatibility -
2024-02-15https://ollama.com/blog/windows-preview -
2024-03-14https://ollama.com/blog/amd-preview
OpenAI
- models
- GPT
- GPT 2
- GPT 3
- GPT 3.5
- GPT 3.5 Turbo
- GPT 4
- GPT 4 Turbo
- GPT 4o
- GPT 4o mini
- o1
- o1 mini
- GPT 4.5
- o3 mini
- GPT 4.1
- GPT 4.1 mini
- GPT 4.1 nano
- o3
- o4 mini
-
2021-07-27https://openai.com/research/triton -
2023-01-31https://openai.com/blog/new-ai-classifier-for-indicating-ai-written-text-
2023-07-20offline due to low accuracy
-
-
2023-03-02ChatGPT political bias -
2023-03-14https://openai.com/index/gpt-4-research/ -
2023-04-17https://www.wired.com/story/openai-ceo-sam-altman-the-age-of-giant-ai-models-is-already-over/ -
2023-04-25https://openai.com/blog/new-ways-to-manage-your-data-in-chatgpt -
2023-05-09https://openai.com/research/language-models-can-explain-neurons-in-language-models -
2023-05-18https://apps.apple.com/app/openai-chatgpt/id6448311069 -
2023-05-31https://openai.com/research/improving-mathematical-reasoning-with-process-supervision -
2023-06-11https://openai.com/blog/function-calling-and-other-api-updates -
2023-07-06https://openai.com/blog/gpt-4-api-general-availability -
2023-07-06ChatGPT Code interpreter -
2023-07-11https://investor.shutterstock.com/news-releases/news-release-details/shutterstock-expands-partnership-openai-signs-new-six-year -
2023-07-18How is ChatGPT's behavior changing over time? -
2023-07-20https://openai.com/blog/custom-instructions-for-chatgpt -
2023-07-25https://play.google.com/store/apps/details?id=com.openai.chatgpt -
2023-07-26https://openai.com/blog/frontier-model-forum- Anthropic, Google, Microsoft, and OpenAI
-
2023-08-28https://openai.com/blog/introducing-chatgpt-enterprise -
2023-08-31https://openai.com/blog/teaching-with-ai -
2023-09-25https://openai.com/blog/chatgpt-can-now-see-hear-and-speak-
https://cdn.openai.com/papers/GPTV_System_Card.pdf
- TODO add to models
-
https://cdn.openai.com/papers/GPTV_System_Card.pdf
-
2023-09-27ChatGPT can now browse the internet -
2023-10-03https://cdn.openai.com/papers/DALL_E_3_System_Card.pdf- https://cdn.openai.com/papers/dall-e-3.pdf
- TODO add to models
-
2023-10-11ChatGPT system messages -
2023-10-19https://openai.com/blog/dall-e-3-is-now-available-in-chatgpt-plus-and-enterprise -
2023-10-29ChatGPT PDF support preview -
2023-11-06https://openai.com/blog/new-models-and-developer-products-announced-at-devday- GPT-4 Turbo
- GPT-4 Turbo with Vision
- GPT-4 fine tuning
- Assistants API
- DALL-E 3 API
- Whisper v3
- Copyright Shield
-
2023-11-06https://openai.com/blog/introducing-gpts -
2023-11-17https://openai.com/blog/openai-announces-leadership-transition -
2023-11-21ChatGPT with voice public availability -
2023-11-29https://openai.com/blog/sam-altman-returns-as-ceo-openai-has-a-new-initial-board - https://platform.openai.com/docs/guides/prompt-engineering/strategy-use-external-tools
-
2024-01-10https://openai.com/blog/introducing-chatgpt-team -
2024-01-25https://openai.com/blog/new-embedding-models-and-api-updates -
2024-02-13https://openai.com/blog/memory-and-new-controls-for-chatgpt -
2024-02-15📽️ https://openai.com/research/video-generation-models-as-world-simulators -
2024-03-12https://github.com/openai/transformer-debugger -
2024-03-29https://openai.com/index/navigating-the-challenges-and-opportunities-of-synthetic-voices/ -
2024-04-01https://openai.com/index/start-using-chatgpt-instantly/ -
2024-04-03🧨 edit DALL·E images in ChatGPT -
2024-04-04https://openai.com/index/introducing-improvements-to-the-fine-tuning-api-and-expanding-our-custom-models-program/ -
2024-05-08https://openai.com/index/introducing-the-model-spec/ -
2024-05-13https://openai.com/index/hello-gpt-4o/ -
2024-07-18https://openai.com/index/gpt-4o-mini-advancing-cost-efficient-intelligence/ -
2024-07-25https://openai.com/index/searchgpt-prototype/ -
2024-09-12https://openai.com/index/openai-o1-mini-advancing-cost-efficient-reasoning/ -
2024-09-12https://openai.com/index/learning-to-reason-with-llms/ -
2024-12-05https://openai.com/index/introducing-chatgpt-pro/ -
2024-12-05https://openai.com/index/openai-o1-system-card/ -
2024-12-18https://help.openai.com/en/articles/10193193-1-800-chatgpt-calling-and-messaging-chatgpt-with-your-phone -
2025-01-31https://openai.com/index/openai-o3-mini/ -
2025-02-02https://openai.com/index/introducing-deep-research/ -
2025-02-23https://openai.com/index/introducing-operator/ -
2025-02-27https://openai.com/index/introducing-gpt-4-5/ -
2025-02-27https://openai.com/index/gpt-4-5-system-card/ -
2025-03-25https://openai.com/index/introducing-4o-image-generation/ -
2025-04-14https://openai.com/index/gpt-4-1/ -
2025-04-16https://openai.com/index/introducing-o3-and-o4-mini/
Runway ML
Stability AI
- 🧨 https://clipdrop.co
-
2023-03-17https://stability.ai/blog/stable-diffusion-reimagine -
2023-05-25https://stability.ai/blog/stability-ai-clipdrop-launches-reimagine-xl -
2023-06-08https://stability.ai/blog/clipdrop-launches-uncrop-the-ultimate-aspect-ratio-editor -
2023-07-13https://stability.ai/blog/clipdrop-launches-stable-doodle
-
-
2023-05-11🧨 https://stability.ai/blog/stable-animation-sdk - 🧨 https://dreamstudio.ai
-
2023-05-17https://github.com/Stability-AI/StableStudio
-
-
2023-08-11https://research.stability.ai/chat -
2023-11-01https://stability.ai/news/stability-ai-enhanced-image-apis-for-business-features -
2023-11-21https://stability.ai/news/stable-video-diffusion-open-ai-video-model -
2023-11-28https://stability.ai/news/stability-ai-sdxl-turbo -
2023-12-07https://stability.ai/news/stablelm-zephyr-3b-stability-llm -
2024-01-16https://stability.ai/news/stable-code-2024-llm-code-completion-release -
2024-02-12https://stability.ai/news/introducing-stable-cascade -
2024-02-22🧨 https://stability.ai/news/stable-diffusion-3 -
2024-03-13https://stability.ai/news/celebrating-one-year-of-medarc -
2024-03-18https://stability.ai/news/introducing-stable-video-3d -
2024-03-23https://stability.ai/news/stabilityai-announcement -
2024-03-25https://stability.ai/news/introducing-stable-code-instruct-3b -
2024-04-03🎧 https://stability.ai/news/stable-audio-2-0 -
2024-04-08https://stability.ai/news/introducing-stable-lm-2-12b -
2024-04-17🧨 https://stability.ai/news/stable-diffusion-3-api -
2024-06-05🎧 https://stability.ai/news/introducing-stable-audio-open -
2024-06-12🧨 https://stability.ai/news/stable-diffusion-3-medium -
2024-08-01https://stability.ai/news/introducing-stable-fast-3d
xAI
-
2023-07-12xAI Grok -
2023-11-06https://x.ai/prompt-ide/
Other news
-
2019-10-18https://thegradient.pub/understanding-evaluation-metrics-for-language-models/ -
2020-07-28https://dugas.ch/artificial_curiosity/GPT_architecture.html -
2022-03-30https://kipp.ly/transformer-param-count -
2023-02-14https://writings.stephenwolfram.com/2023/02/what-is-chatgpt-doing-and-why-does-it-work/ -
2023-02-21https://www.pinecone.io/learn/langchain/ -
2023-03-02https://www.activeloop.ai/resources/ultimate-guide-to-lang-chain-deep-lake-build-chat-gpt-to-answer-questions-on-your-financial-data/ -
2023-03-04https://confusedbit.dev/posts/how_does_gpt_work/ -
2023-03-29https://github.com/xtekky/gpt4free -
2023-03-29https://ai.v-gar.de/ml/transformer/timeline/- TODO update csv
-
2023-03-30https://kipp.ly/transformer-taxonomy -
2023-04-05ChaosGPT -
2023-04-10https://www.izzy.co/blogs/robo-boys.html -
2023-04-15https://www.activeloop.ai/resources/lang-chain-gpt-4-for-code-understanding-twitter-algorithm/ -
2023-04-16🧨 https://www.shruggingface.com/blog/how-i-used-stable-diffusion-and-dreambooth-to-create-a-painted-portrait-of-my-dog -
2023-04-19https://www.similarweb.com/blog/insights/ai-news/stack-overflow-chatgpt/ -
2023-04-20https://github.com/Vision-CAIR/MiniGPT-4 -
2023-04-21https://github.com/brexhq/prompt-engineering -
2023-04-21https://gist.github.com/timesler/4b244a6b73d6e02d17fd220fd92dfaec -
2023-04-22https://magazine.sebastianraschka.com/p/finetuning-large-language-models -
2023-04-28https://github.com/jostmey/NakedAttention -
2023-04-29https://thegradient.pub/in-context-learning-in-context/ -
2023-04-30https://github.com/Mooler0410/LLMsPracticalGuide -
2023-04-30https://agi-sphere.com/llama-models/ -
2023-04-30https://www.wired.com/story/how-chatgpt-works-large-language-model/ -
2023-05-01https://www.kdnuggets.com/2023/05/machine-learning-chatgpt-cheat-sheet.html -
2023-05-02https://www.philschmid.de/sagemaker-fsdp-gpt -
2023-05-03https://www.assemblyai.com/blog/the-full-story-of-large-language-models-and-rlhf/ -
2023-05-05https://www.pinecone.io/learn/vector-database -
2023-05-09https://newsroom.ibm.com/2023-05-09-IBM-Unveils-the-Watsonx-Platform-to-Power-Next-Generation-Foundation-Models-for-Business -
2023-05-12https://www.kdnuggets.com/2023/05/8-free-ai-llms-playgrounds.html -
2023-05-15https://erichartford.com/uncensored-models -
2023-05-15https://blog.gopenai.com/how-to-speed-up-llms-and-use-100k-context-window-all-tricks-in-one-place-ffd40577b4c -
2023-05-17https://github.com/ray-project/llm-numbers -
2023-05-25✨ https://a16z.com/2023/05/25/ai-canon -
2023-06-05🧨 https://www.reddit.com/r/StableDiffusion/comments/141hg9x/controlnet_for_qr_code/ -
2023-06-12https://www.evidentlyai.com/ml-system-design -
2023-06-14https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/the-economic-potential-of-generative-ai-the-next-productivity-frontier -
2023-06-15https://crfm.stanford.edu/2023/06/15/eu-ai-act.html -
2023-06-20https://a16z.com/2023/06/20/emerging-architectures-for-llm-applications/ -
2023-06-27https://mlops.community/unraveling-gpu-inference-costs-for-fine-tuned-open-source-models-v-s-closed-platforms/ -
2023-06-30✨ https://neptune.ai/blog/mlops-tools-platforms-landscape -
2023-07-03https://www.similarweb.com/blog/insights/ai-news/chatgpt-traffic-drops/ -
2023-07-09https://blog.mithrilsecurity.io/poisongpt-how-we-hid-a-lobotomized-llm-on-hugging-face-to-spread-fake-news/ -
2023-07-10https://www.semianalysis.com/p/gpt-4-architecture-infrastructure -
2023-07-20https://www.cursor.so/blog/llama-inference -
2023-07-20✨ https://github.com/tikkuncreation/llama-2-resources -
2023-07-22https://replicate.com/blog/run-llama-locally -
2023-07-23https://willthompson.name/what-we-know-about-llms-primer -
2023-07-27https://stackoverflow.blog/2023/07/27/announcing-overflowai/- TODO
-
2023-07-27https://hacks.mozilla.org/2023/07/so-you-want-to-build-your-own-open-source-chatbot/ -
2023-07-27https://llm-attacks.org -
2023-07-30https://betterprogramming.pub/frameworks-for-serving-llms-60b7f7b23407 -
2023-07-30✨ https://eugeneyan.com/writing/llm-patterns/ -
2023-07-31https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/ -
2023-08-08https://www.trustwave.com/en-us/resources/blogs/spiderlabs-blog/wormgpt-and-fraudgpt-the-rise-of-malicious-llms/ -
2023-08-09✨ https://blog.briankitano.com/llama-from-scratch/ -
2023-08-11https://www.anyscale.com/blog/fine-tuning-llama-2-a-comprehensive-case-study-for-tailoring-models-to-unique-applications -
2023-08-24https://scale.com/blog/open-ai-scale-partnership-gpt-3-5-fine-tuning -
2023-09-06raccoons.be - gen AI playbook -
2023-09-12Fine-tune your own Llama 2 to replace GPT-3.5/4 -
2023-09-13https://www.anyscale.com/blog/a-comprehensive-guide-for-building-rag-based-llm-applications-part-1 -
2023-09-14LLM agent survey -
2023-09-18https://www.anyscale.com/anyscale-launches-new-service-anyscale-endpoints-10x-more-cost-effective-for-most-popular-open-source-llms -
2023-10-09https://blog.replit.com/ai4all -
2023-10-18How ChatGPT Vision Works - YouTube -
2023-11-08Samsung Gauss -
2023-11-16https://pytorch.org/blog/accelerating-generative-ai -
2023-11-29https://blog.perplexity.ai/blog/introducing-pplx-online-llms -
2023-11-30https://pytorch.org/blog/accelerating-generative-ai-2 -
2023-12-09https://www.europarl.europa.eu/news/en/press-room/20231206IPR15699/artificial-intelligence-act-deal-on-comprehensive-rules-for-trustworthy-ai -
2024-01-03https://pytorch.org/blog/accelerating-generative-ai-3 -
2024-02-01https://shyam.blog/posts/beyond-self-attention/ -
2024-02-01https://allenai.org/olmo -
2024-02-19Transformers demystified - https://www.chenyang.co/diffusion.html
-
2024-05-29https://foojay.io/today/indexing-all-of-wikipedia-on-a-laptop/ -
2024-06-05https://opening-up-chatgpt.github.io/ -
2024-06-08https://applied-llms.org/ -
2024-06-13https://huggingface.co/blog/mlabonne/abliteration -
2024-06-19https://epochai.org/data/notable-ai-models - https://situational-awareness.ai/
-
2024-06-25https://imbue.com/research/70b-infrastructure/ - https://fastvoiceagent.cerebrium.ai/
-
2024-08-09LLM price-performance graph
Tools
- https://onnx.ai/
- 🖥️ https://www.tabnine.com
- https://www.baseten.co
- http://vectors.nlpl.eu/explore/embeddings/en/
- https://github.com/togethercomputer/OpenChatKit
- https://github.com/streamlit/streamlit
-
2020-07-09https://github.com/gradio-app/gradio -
2022-05-27https://github.com/Dao-AILab/flash-attention -
2022-09-02🧨 https://promptbase.com -
2021-12-21🧨 https://github.com/invoke-ai/InvokeAI -
2022-09-12🧨 https://github.com/brycedrennan/imaginAIry -
2022-10-24https://github.com/hwchase17/langchain -
2022-12-21🧨 https://github.com/oobabooga/text-generation-webui -
2023-01-17🧨 https://github.com/comfyanonymous/ComfyUI -
2023-01-24🧨 https://github.com/AUTOMATIC1111/stable-diffusion-webui -
2023-01-25🧨 https://www.shutterstock.com/ai-image-generator- uses DALL-E
-
2023-01-29https://github.com/arc53/DocsGPT -
2023-03-13https://github.com/ShreyaR/guardrails -
2023-03-16🖥️ https://github.com/TabbyML/tabby -
2023-03-19https://chat.lmsys.org -
2023-03-30https://github.com/Significant-Gravitas/Auto-GPT - 🧨 https://github.com/apple/ml-stable-diffusion
-
2023-03-10https://github.com/ggerganov/llama.cpp-
2023-05-14now supports partial GPU usage -
2023-06-12CUDA support
-
-
2023-03-14https://github.com/getumbrel/llama-gpt -
https://github.com/bigscience-workshop/petals
- distributed LLM finetuning and inference
- https://github.com/jerryjliu/llama_index
-
https://github.com/jupyterlab/jupyter-ai
- Codex clone using https://huggingface.co/Salesforce/codegen-350M-multi
- https://kubiya.ai
- https://www.chatpdf.com
- https://pdf.ai
-
2023-03-30https://www.cursor.so -
2023-04-18https://github.com/h2oai/h2o-llmstudio -
2023-04-19✨ https://github.com/smallcloudai/refact - https://github.com/paulpierre/RasaGPT
-
2023-04-29https://github.com/mlc-ai/mlc-llm -
2023-05-02https://www.modular.com/mojo -
2023-05-02https://heypi.com -
2023-05-02https://github.com/imartinez/privateGPT -
2023-05-06https://github.com/nadermx/backgroundremover -
2023-05-12https://github.com/assafelovic/gpt-researcher - https://github.com/LucienShui/huggingface-vscode-endpoint-server
- https://gandalf.lakera.ai
- https://faraday.dev
- https://sourcegraph.com
-
2023-05-13🧨 https://github.com/varunshenoy/opendream -
2023-05-13https://github.com/StanGirard/quivr -
2023-05-24https://github.com/artidoro/qlora - 🖥️ https://github.com/continuedev/continue
-
2023-06-08https://simonwillison.net/2023/Jun/8/gpt-tokenizers/ -
2023-06-11https://github.com/AntonOsika/gpt-engineer -
2023-06-20https://github.com/embedchain/embedchain -
2023-06-29https://github.com/ShishirPatil/gorilla -
2023-07-14https://github.com/KillianLucas/open-interpreter -
2023-07-18https://github.com/Pythagora-io/gpt-pilot -
2023-07-24https://github.com/kuafuai/DevOpsGPT -
2023-07-26https://github.com/Alpha-VLLM/LLaMA2-Accessory -
2023-08-10https://github.com/modelscope/facechain -
2023-08-27https://github.com/turboderp/exllamav2 -
2023-09-07https://labs.heygen.com/video-translate - https://www.trulens.org
- https://www.phind.com/search?home=true
-
2023-10-05GenAI Stack - Docker -
2023-10-20https://neuralmagic.com/blog/building-sparse-llm-applications-on-cpus-with-langchain-and-deepsparse/- https://github.com/neuralmagic/deepsparse
- people behind DeepSparse were also behind GPTQ
-
2023-11-01🧨 https://lumalabs.ai/genie -
2023-11-18https://github.com/tldraw/make-real - https://www.perplexity.ai
-
2023-11-28📽️ https://pika.art/launch -
2023-11-29https://hacks.mozilla.org/2023/11/introducing-llamafile/ - https://github.com/ishan0102/vimGPT
- https://github.com/BuilderIO/gpt-crawler
- https://github.com/Vaibhavs10/insanely-fast-whisper
- https://github.com/intel/intel-extension-for-transformers
- https://github.com/coqui-ai/TTS
- https://github.com/Niek/chatgpt-web
- https://lmstudio.ai
- https://www.suno.ai
- https://github.com/imoneoi/openchat
-
2023-12-03https://github.com/myshell-ai/OpenVoice -
2023-12-16https://github.com/SJTU-IPADS/PowerInfer -
2023-12-19🧨 https://github.com/cumulo-autumn/StreamDiffusion - https://github.com/dvmazur/mixtral-offloading
-
2024-01-15🧨 https://github.com/InstantID/InstantID - https://jan.ai/
- https://github.com/danswer-ai/danswer
- https://github.com/KillianLucas/open-interpreter
-
2024-02-19https://groq.com -
2024-03-12https://www.cognition-labs.com/introducing-devin -
2024-03-28https://github.com/jasonppy/VoiceCraft - 🧨 https://github.com/lllyasviel/Fooocus
- 📽️ https://blog.lumalabs.ai/p/dream-machine
-
2024-07-11https://tridao.me/blog/2024/flash3/ - https://github.com/saoudrizwan/claude-dev
- https://github.com/OpenInterpreter/open-interpreter
