To those who are left behind in the AI rush

Ever feel overwhelmed by AI posts on every social media and professional platform? And don’t understand the AI jargon, buzzwords, and concepts being discussed? I am one of them, and here is a cheat sheet summarizing the key points to help you get up to speed.

Core Concepts

  • NLP (Natural Language Processing): The broad field of AI that teaches computers to understand, interpret, and generate human language.
  • LLM (Large Language Model): A type of AI trained on massive amounts of text to predict the next word in a sequence, allowing it to chat, write, and summarize.
  • LMM (Large Multimodal Model): An advanced version of an LLM that can understand more than just text, such as images, audio, and video.
  • Foundational Model: A massive, “general purpose” AI model (like GPT-4) trained on huge data that serves as a base to be adapted for many different specific tasks.
  • Multimodal: The ability of an AI to process and “connect” different types of data (modes) at once, like seeing an image and describing it in text.

Practical Techniques

  • Prompt Engineering: The art of crafting specific instructions or “prompts” to get the best possible answer from an AI model.
  • Finetune: Taking a pre-trained model and giving it extra training on a smaller, specific dataset to make it an expert in a particular subject or style.
  • RAG (Retrieval-Augmented Generation): Connecting an AI to a trusted external database (like your company’s files) so it can look up real-time facts before answering, reducing “hallucinations”.
  • Embedding Model: A tool that converts text or images into a list of numbers (vectors) that represent their meaning, allowing computers to compare how “similar” two ideas are.

Advanced AI & Engineering

  • AI Engineering: The practice of building complete applications using ready-made AI models (like foundation models). Instead of training a model from scratch, an AI engineer focuses on integrating these models into software, ensuring they are secure, cost-effective, and useful for the user.
  • ML Engineering (Machine Learning Engineering): A specialized role focused on creating and improving the models themselves. This involves training algorithms from scratch, cleaning data, and optimizing mathematical performance.
  • Agentic AI: AI that doesn’t just talk but can act. It can break a big goal into steps, use tools (like searching the web or booking a flight), and complete tasks autonomously.
  • MLOps (Machine Learning Operations): The “factory” side of AI; it’s a set of practices to ensure AI models are updated, monitored, and running reliably in the real world.

How the Roles Work Together

In a real-world project (like building a customer support agent), the roles collaborate in a “pipeline”:

  • ML Engineer: Designs and trains the “brain.” They might build a custom embedding model from scratch so the company’s specific technical jargon is understood perfectly.
  • AI Engineer: Connects the brain to the world. They take a foundational model (like GPT-4), set up RAG so it can read private manuals, and use prompt engineering to ensure it stays polite.
  • MLOps Engineer: Builds the “factory.” they ensure that if 10,000 customers use the app at once, the servers don’t crash, and they monitor the model to make sure it doesn’t start giving wrong answers over time.

Essential AI Engineering Tools

AI Engineers today rely on a specific stack to move fast without building everything from scratch:

  • Orchestration (Linking things together): LangChain and LlamaIndex are the “glue” used to build complex workflows and Agentic AI.
  • Vector Databases (For RAG): Pinecone, Weaviate, and Chroma store the embeddings that allow an AI to “search” through your documents.
  • App Builders & UI: Streamlit and Gradio are used to quickly create chat interfaces or dashboards for the AI.
  • Evaluation & Observability: LangSmith and Arize Phoenix help engineers “see” into the AI’s thoughts to debug why it might be hallucinating.
  • Local Development: Ollama and LM Studio allow engineers to run powerful LLMs directly on their own laptops for private testing

Catching up on the tools and models in the industry

To help you catch up on the rapid evolution of AI, here is a breakdown of the leading tools and models currently dominating the industry.

  1. Foundational Models:
    These are general-purpose engines used as the basis for other applications.

    • Text & General Reasoning: OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, Google’s Gemini 1.5 Pro, and Meta’s Llama 3 are examples of text and general reasoning models. Llama 3 is a leader in open-source models.
    • Image Generation: Stable Diffusion and Midjourney are examples of image generation models.
    • Video Generation: Synthesia (for talking heads) and Google Veo (for cinematic clips) are used for video generation.
  2. AI Applications & Assistants:
    These are ready-to-use products designed to improve productivity.

    • Coding Assistants: GitHub Copilot is a standard tool for real-time coding assistance.
    • Meeting Automation: Fireflies.ai records, transcribes, and summarizes meetings automatically.
    • Search & Research: Perplexity AI is a conversational search engine that cites sources.
    • Workflow Assistants: Lindy can execute multi-step business workflows across different apps.
  3. RAG Tools:
    These tools help AI interact with private data.

    • Frameworks: LlamaIndex specializes in indexing and retrieving data. LangChain is popular for building application logic.
    • Vector Databases: Pinecone (managed cloud), Weaviate (open-source), and Chroma store the “mathematical meaning” of documents for search.
    • Evaluation: LangSmith and Arize Phoenix help with debugging AI responses.
  4. Agentic AI:
    These frameworks enable AI to perform autonomous tasks.

    • Multi-Agent Teams: CrewAI and Microsoft’s AutoGen allow the creation of AI agent teams with specific roles.
    • Complex Workflows: LangGraph provides control over how an agent performs a task, including human-approval steps.
    • Autonomous Coding: Goose and Claude Code can edit files and run tests.
  5. Other Important Engineering Tools:

    • Local AI Runners: Ollama is a tool for running open-source models like Llama 3 on a personal computer.
    • Model Optimization: DSPy optimizes prompts automatically.
Total
0
Shares
Leave a Reply

Your email address will not be published. Required fields are marked *

Previous Post

Snowflake Drawing Tool

Next Post

Prometheus #1

Related Posts