Skip to content
View everglow01's full-sized avatar
🤺
Focusing
🤺
Focusing
  • everglow818@outlook.com
  • Hunan Changsha
  • 02:36 (UTC +08:00)

Block or report everglow01

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
everglow01/README.md

Hi, I'm Owen / everglow01 👋

Computer Vision · VLM / VLA · Practical AI Systems

Typing SVG

Blog Email GitHub


👨‍💻 About me

I'm a computer vision learner and builder, currently focusing on AI scene understanding for real-world camera systems. I care about both model performance and whether a vision system can actually work reliably in messy, practical environments.

I have hands-on experience with core vision tasks such as object detection, instance segmentation, visual perception, and scene understanding. I'm also interested in VLMs, VLA models, and end-to-end vision-language-action systems, especially how frontier models connect perception, reasoning, and interaction. I enjoy turning research ideas into usable prototypes and tools.


🔭 Current interests

  • Visual AI systems for cameras and surveillance scenarios
  • Object detection, segmentation, visual perception, and scene reasoning
  • VLM / VLA applications, multimodal agents, and end-to-end perception-to-action models
  • CNN / Transformer / vision foundation model applications
  • PyTorch training & inference, OpenCV pipelines, and desktop AI tools
  • Building clean demos, evaluation scripts, and practical engineering workflows

🧰 Tech stack

Skill icons

VLM VLA Vision Foundation Models Multimodal AI


🚀 Selected projects

  • Windows-Face-Hello — RGB webcam face unlock experiment for Windows, combining face recognition, liveness detection, and system integration.
  • Owen-Studio — My technical blog for computer vision and deep learning notes.
  • VIT — Vision model experiments covering classification, detection, segmentation, and training workflows.
  • lane-vehicle-counter — Lane-level vehicle detection and counting based on OpenCV.
  • VLM-Workbench — A desktop prototype for exploring structured visual understanding with VLMs.

📊 GitHub stats

GitHub stats GitHub Streak
Top languages
GitHub Activity Graph

Thanks for visiting — feel free to explore my projects or reach me by email. 🚀

Pinned Loading

  1. VIT VIT Public

    A unified ViT/Swin computer vision framework for image classification, object detection, and instance segmentation, with COCO support, transfer learning, and auto experiment management.

    Python 5 1

  2. Windows-Face-Hello Windows-Face-Hello Public

    Face unlock for Windows using an ordinary RGB webcam — a C++ Credential Provider + LocalSystem service with InsightFace recognition and MediaPipe liveness, for cameras Windows Hello doesn't support.

    Python 2

  3. Owen-Studio Owen-Studio Public

    个人技术博客,聚焦计算机视觉与深度学习。在线地址:https://everglow01.github.io/Owen-Studio/

    HTML 1

  4. Attention_Module Attention_Module Public

    Modular PyTorch implementations of popular attention mechanisms (SE, CBAM, ECA, CA, BAM, SimAM, and more) designed for easy integration into any CNN or Transformer model.

    Python 3