Future Forward - Emerging Tech & AI Newsletter - 17th Edition
Welcome to the 17th Edition of Future Forward - the Emerging Tech & AI Newsletter!
This newsletter aims to help you stay up-to-date on the latest trends in emerging technologies. Subscribe to the newsletter today and never miss a beat!
Here's what you can expect in each new issue of the Emerging Tech & AI Newsletter:
Last Week in AI/Emerging Tech
The field of AI is experiencing rapid and continuous progress in various areas. Some of the notable advancements and trends from the last week include:
Big Tech in AI:
Funding & VC Landscape:
Other AI news:
Liked the news summary. Subscribe to the newsletter to keep getting updates every week. Check the comments section on the LinkedIn article for links to the Other AI news.
Large Vision Models
Just as Large Language Models (LLMs) have revolutionized how we interact with text, LVMs are poised to transform how we understand and interact with the visual world.
What are Large Vision Models?
LVMs are complex deep learning algorithms trained on massive datasets of images and videos. These datasets contain millions of images labelled with detailed information about their content. By analyzing these images, LVMs learn to recognize patterns and relationships, allowing them to perform a wide range of tasks, such as:
Current Applications of LVMs
LVMs are already being used in various applications across diverse industries:
Challenges and Future Directions
Despite their impressive capabilities, LVMs face several challenges:
LVM examples:
Here are some of the latest LVMs:
Vision Transformer (ViT): This LVM broke away from the traditional convolutional neural networks (CNNs) and uses a transformer architecture, achieving state-of-the-art performance in image classification and object detection tasks. ViT models like DeiT, Swin Transformer, and ViT-G/L continue to push the boundaries of performance and efficiency.
CLIP: This LVM excels at aligning image and text data, allowing for tasks like image captioning, visual question answering, and zero-shot learning. New CLIP-based models like UNIMO and ALIGN explore additional functionalities like image generation and attribute manipulation.
LaMDA-ViT: This LVM combines the strengths of Google's LaMDA language model with a ViT visual model, enabling it to understand and respond to complex image-based queries and generate creative text descriptions of images.
PaLM-ViT: Similar to LaMDA-ViT, this model combines PaLM (another powerful LLM) with a ViT model, leading to even more advanced capabilities in image understanding and generation, including image editing, style transfer, and video captioning.
VQ-VAE-2: This LVM utilizes a variational autoencoder architecture for image generation, allowing it to create high-quality and diverse images with greater control over details and style.
Diffusion Models: These models, like Dall-E 2 and Imagen, use a diffusion process to gradually generate images from noise, offering impressive photorealism and artistic potential.
BEiT: This model focuses on pre-training LVMs on massive datasets of unlabeled images, enabling them to learn generalizable representations useful for diverse downstream tasks without the need for extensive labelled data.
LVMs are also expanding beyond image analysis, tackling tasks like video understanding and multi-modal learning. Models like Video Swin Transformer and MEGATRON-Turing NLG are leading the way in these areas. CogVLM is a powerful open-source visual language model (VLM). CogVLM-17B has 10 billion vision parameters and 7 billion language parameters.
Curious to know more? Let us know what follow-up details you would like in the comments and we will plan a detailed article on Tecnologia.
Coding in the AI Era
We wrote a primer on AI-assisted Software Engineering in our fourth edition. Since then there has been tremendous inquiry around the code generation aspect. Tool-based support in writing code has a history spanning over 40 years, originating with features such as syntax highlighting, autocompletion in Integrated Development Environments (IDEs), and code analysis with Linting. In the more recent past, tools like DeepCode (now Synk) have utilized machine learning to provide more sophisticated and intelligent coding suggestions.
AI-assisted coding is not new but has received tremendous attention since the release of tools that can generate code based on text prompts. In Dec 2022, Google DeepMind 's AlphaCode wrote computer programs at a competitive level and has achieved a rank in the top 54%. AlphaCode uses transformer-based language models to generate code at an unprecedented scale, and then smartly filters to a small set of promising programs.
(A) In pretraining, files from GitHub are randomly split into two parts. The first part goes to the encoder as input, and the decoder is trained to produce the second part. (B) In fine-tuning, problem descriptions (formatted as comments) are given to the encoder, and the decoder is trained to generate the solutions. (C) For evaluation, AlphaCode generates many samples for each problem description, then it executes them to filter out bad samples and cluster the remaining ones before finally submitting a small set of candidates.
OpenAI 's ChatGPT(Codex model ) triggered rapid advancements in the area of CodeLLMs. In August this year, Meta released Code Llama which scored 53.7% on HumanEval and 56.2% on MBPP. Github's Copliot has become the world's most adopted AI developer tool. Google offers similar services through Vertex AI and Amazon Web Services (AWS) has CodeWhisperer.
Here are some of the most common use cases these AI-based coding tools/CoPilot's serve:
Many enterprises have also started using open-source models/ paid models to create bespoke solutions that fit their needs of code generation and other use cases as mentioned above. There are several wrappers available to build something similar to GitHub CoPilot - Faux Pilot and continue.
We covered LLMs from big tech AI companies above. Here's a list of additional LLMs :
Here's how different code LLMs compare
Source - Awesome LLM on github
Challenges: LLMs, or Language Models, are known to occasionally produce hallucinations, and this phenomenon is not exclusive to code-generating LLMs. Similar to other instances of LLM hallucinations, the generated code may appear well-structured and function adequately, but it might not perform the intended actions, leading to the creation of subtle bugs that can be challenging to identify.
Coding in the AI era: The Code LLM technology is improving day by day. In the AI era, development teams must adapt to this new way of AI-assisted coding. They should learn how to best work with code-generating LLMs: which LLMs to use, what tooling is available, what prompts should be used to get desired results and how to ensure that there are no errors in the AI-generated code.
Disclosure: Content in Large Vision Models in the article was written with the help of Google Bard.
Thanks for reading. See you next week!
Let's explore the future of technology together!
2x CXO | 4x Exit | AI-led Transformation | Investor & Advisor | Building Intelligent, Scalable Organizations | MobileAppDaily Tech 40 under 40 List 2023 | AI Strategy & Leadership | GrowthX Fellow | Your goto AI guy
1yLinks for Other AI News: 1. Magic Animate - https://huggingface.co/spaces/zcxu-eric/magicanimate 2. McDonald 2024 Generative AI roadmap - https://www.theverge.com/2023/12/6/23990900/mcdonalds-google-ai-cloud-generative 3. AI helps in understanding whale languages - https://osf.io/preprints/osf/285cs 4. AI networks are more vulnerable to malicious attacks than previously thought - https://www.sciencedaily.com/releases/2023/12/231204135128.htm 5. Blackrock to Roll Out Generative AI Tools Next Month - https://www.morningstar.com/news/dow-jones/202312065138/blackrock-to-roll-out-generative-ai-tools-next-month-ft-reports