Category

    Cohere AI Releases C4AI Command R+: An Open Weights Research Release of a 104B Parameter Model with Highly Advanced Capabilities Including Tools like RAG
    This AI Paper from China Proposes a Novel Architecture Named-ViTAR (Vision Transformer with Any Resolution)
    This AI Paper from China Proposes a Novel Architecture Named-ViTAR (Vision Transformer with Any Resolution)

    The remarkable strides made by the Transformer architecture in Natural Language Processing (NLP) have ignited a surge of interest within …

    read more
    China plans to disrupt elections with AI-generated disinformation
    China plans to disrupt elections with AI-generated disinformation

    Beijing is expected to ramp up sophisticated AI-generated disinformation campaigns to influence several high-profile elections in 2024, according to Microsoft’s …

    read more
    Myshell AI and MIT Researchers Propose JetMoE-8B: A Super-Efficient LLM Model that Achieves LLaMA2-Level Training with Just US $0.1M
    Myshell AI and MIT Researchers Propose JetMoE-8B: A Super-Efficient LLM Model that Achieves LLaMA2-Level Training with Just US $0.1M

    In an era where artificial intelligence (AI) development often seems gated behind billion-dollar investments, a new breakthrough promises to democratize …

    read more
    Researchers at Google AI Innovates Privacy-Preserving Cascade Systems for Enhanced Machine Learning Model Performance
    Researchers at Google AI Innovates Privacy-Preserving Cascade Systems for Enhanced Machine Learning Model Performance

    The cascades concept has emerged as a critical mechanism, particularly for large language models (LLMs). These cascades enable a smaller, …

    read more
    Meet Atla: A Machine Learning Startup Building an AI Evaluation Model to Unlock the Full Potential of Language Models for Developers
    Meet Atla: A Machine Learning Startup Building an AI Evaluation Model to Unlock the Full Potential of Language Models for Developers

    Large language models (LLMs) are in charge of innovation in the rapidly expanding artificial intelligence (AI) field. When creating new …

    read more
    EURUS: A Suite of Large Language Models (LLMs) Optimized for Reasoning, Achieving State-of-the-Art Results among Open-Source Models on Diverse Benchmarks
    Meet SWE-Agent: An Open-Source Software Engineering Agent that can Fix Bugs and Issues in GitHub Repositories
    Meet SWE-Agent: An Open-Source Software Engineering Agent that can Fix Bugs and Issues in GitHub Repositories

    Fixing bugs and issues in code repositories can be challenging in software engineering. Imagine encountering a bug in a GitHub …

    read more
    Researchers from ETH Zurich, EPFL, and Microsoft Introduce QuaRot: A Machine Learning Method that Enables 4-bit Inference of LLMs by Removing the Outlier Features
    Researchers from ETH Zurich, EPFL, and Microsoft Introduce QuaRot: A Machine Learning Method that Enables 4-bit Inference of LLMs by Removing the Outlier Features

    Large language models (LLMs) have revolutionized various applications across industries by providing advanced natural language processing capabilities. These models’ ability …

    read more
    TFB: An Open-Source Machine Learning Library Designed for Time Series Researchers
    TFB: An Open-Source Machine Learning Library Designed for Time Series Researchers

    Robust benchmarks are indispensable tools in the arsenal of researchers, providing a rigorous framework for evaluating new methods across a …

    read more