kade.im
Easy Convert for NPU

Easy Convert for NPU

표시
Project Years
2024
Tags
NPU
AICHIP
AI
MLOPS
VisionAI
Skills
Python
NextJS
Typescript
k8s
Docker
notion image
notion image
 
You can try in below link ( It takes max 30 minutes in converting )

Introduce

  • This project’s main goal is making NPU usage more easily for client
    • client can know the process before using NPU for AI Service
  • Whole process can be done with web UI, server located in DUDAJI office
      1. Upload gpu trained model
      1. Test that model can be run on NPU
      1. Calibrate and Quantize by selected algorithm
      1. Download ENF converted Model
      1. Test directly on UI (Yolo, vision only)
 

Making EasyConvert v2

notion image
 
 

Key Learnings and Insights

This project focused on simplifying NPU adoption by providing an intuitive end-to-end model conversion workflow through a web-based UI.
  1. Developing a User-Friendly Interface for AI Model Conversion
      • Learned how to design a UI/UX that makes complex AI chip usage accessible to users with minimal technical expertise.
      • Focused on streamlining model upload, NPU compatibility testing, quantization, and conversion into an easy-to-follow web process.
  1. Bridging the Gap Between AI Users and NPU Technology
      • Realized the importance of educating clients on NPU workflows before they adopt the hardware for AI services.
      • Explored ways to improve visibility into the NPU conversion process, ensuring users understand each step before finalizing their models.
  1. Future Considerations for NPU Adoption
      • Gained insights into the challenges of wider NPU adoption, particularly in terms of model compatibility, calibration methods, and developer accessibility.
      • Recognized the need for better documentation, automated optimizations, and seamless integration to make NPUs more appealing to the broader AI community.
This project reinforced my understanding of how AI infrastructure should be designed for accessibility, and it provided valuable experience in developing user-friendly tools for AI hardware deployment.