Close Menu
Şevket Ayaksız

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Save 45% on Anker’s Prime 6-in-1 USB-C Charger

    Mayıs 8, 2025

    Tariffs Force 8BitDo to Pause U.S. Deliveries

    Mayıs 8, 2025

    PC Manager App Now Displays Microsoft 365 Advertisements

    Mayıs 8, 2025
    Facebook X (Twitter) Instagram
    • software
    • Gadgets
    Facebook X (Twitter) Instagram
    Şevket AyaksızŞevket Ayaksız
    Subscribe
    • Home
    • Technology

      Ryzen 8000 HX Series Brings Affordable Power to Gaming Laptops

      Nisan 10, 2025

      Today only: Asus OLED laptop with 16GB RAM drops to $550

      Nisan 6, 2025

      Panther Lake: Intel’s Upcoming Hybrid Hero for PCs

      Nisan 5, 2025

      A new Xbox gaming handheld? Asus’ teaser video sparks speculation

      Nisan 2, 2025

      Now available—Coolify’s ‘holographic’ PC fans bring a unique visual effect

      Nisan 2, 2025
    • Adobe
    • Microsoft
    • java
    • Oracle
    Şevket Ayaksız
    Anasayfa » Deploying Deep Learning in Production: Achieving Multiple Efficiencies
    software

    Deploying Deep Learning in Production: Achieving Multiple Efficiencies

    By mustafa efeAğustos 3, 2024Yorum yapılmamış3 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    How TalkingData Uses AWS Open Source Deep Java Library with Apache Spark for Scalable Machine Learning Inference

    TalkingData is a leading data intelligence service provider, specializing in delivering actionable insights on consumer behavior, preferences, and trends. A core component of their offering is leveraging advanced machine learning and deep learning models to predict consumer behaviors. For instance, a car dealer might use these insights to target ads more effectively, focusing on potential buyers who are predicted to purchase a car within the next few months.

    Initially, TalkingData relied on an XGBoost model for such predictions. However, their data science team sought to explore whether deep learning models could deliver superior performance for their use case. After extensive experimentation, they developed a deep learning model using PyTorch, an open-source deep learning framework. This new model demonstrated a 13% improvement in recall rate, meaning it provided more accurate predictions while maintaining a consistent level of precision.

    Despite these improvements, deploying deep learning models at TalkingData’s scale presented significant challenges. The company needed to generate hundreds of millions of predictions daily, which required robust processing capabilities. Previously, they used Apache Spark, an open-source distributed processing engine, to manage large-scale data processing tasks. While Spark excels at distributing tasks across multiple instances for faster processing, it is a Java/Scala-based platform that can encounter issues when integrating with Python-based applications. Specifically, Spark’s Java garbage collector often struggles to manage memory usage effectively for Python programs, leading to potential crashes and inefficiencies.

    Although the XGBoost model had native support for Java, allowing TalkingData to deploy it directly within Spark, PyTorch did not offer a similar Java API. This lack of native support created a problem: TalkingData could not directly execute their PyTorch model within Apache Spark due to the aforementioned memory management issues. To address this, they had to transfer data from Spark to a separate GPU instance for model inference. This workaround not only increased the overall processing time but also added complexity and maintenance overhead.

     

     

    A breakthrough came when TalkingData’s production team learned about DJL (Deep Java Library) through the article “Implement Object Detection with PyTorch in Java in 5 Minutes with DJL.” DJL, an open-source deep learning framework developed by AWS, is designed to run deep learning models in Java. It supports various deep learning engines, including PyTorch, and provides a solution to integrate deep learning models with Java-based environments like Apache Spark.

    By adopting DJL, TalkingData was able to execute their PyTorch model directly within Apache Spark, eliminating the need for separate GPU instances. This integration streamlined their processing pipeline, resulting in a 66% reduction in running time and significant cuts in maintenance costs. DJL’s compatibility with Spark allowed TalkingData to optimize their deep learning deployment, achieving greater efficiency and performance.

    In summary, the use of DJL enabled TalkingData to overcome the challenges associated with deploying deep learning models at scale, integrating seamlessly with their existing Apache Spark infrastructure. This solution not only improved processing efficiency but also simplified maintenance, illustrating how advancements in technology can lead to substantial operational benefits.

    Post Views: 79
    java Programming Languages Software Development
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    mustafa efe
    • Website

    Related Posts

    PC Manager App Now Displays Microsoft 365 Advertisements

    Mayıs 8, 2025

    Microsoft Raises Xbox Series X Price by $100 Amid Global Adjustments

    Mayıs 8, 2025

    The Cot framework simplifies web development in Rust

    Nisan 29, 2025
    Add A Comment

    Comments are closed.

    Editors Picks
    8.5

    Apple Planning Big Mac Redesign and Half-Sized Old Mac

    Ocak 5, 2021

    Autonomous Driving Startup Attracts Chinese Investor

    Ocak 5, 2021

    Onboard Cameras Allow Disabled Quadcopters to Fly

    Ocak 5, 2021
    Top Reviews
    9.1

    Review: T-Mobile Winning 5G Race Around the World

    By sevketayaksiz
    8.9

    Samsung Galaxy S21 Ultra Review: the New King of Android Phones

    By sevketayaksiz
    8.9

    Xiaomi Mi 10: New Variant with Snapdragon 870 Review

    By sevketayaksiz
    Advertisement
    Demo
    Şevket Ayaksız
    Facebook X (Twitter) Instagram YouTube
    • Home
    • Adobe
    • microsoft
    • java
    • Oracle
    • Contact
    © 2025 Theme Designed by Şevket Ayaksız.

    Type above and press Enter to search. Press Esc to cancel.