Skip to content
View caipeng328's full-sized avatar

Block or report caipeng328

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
caipeng328/README.md

About Me:


I'm a Computer Vision Engineer.

Main Research Areas Intelligent Document/Document parse/3D Vision/Multimodal Large Language Models
Base: BeiJing

Papers

Open Work

Time Task Model Overview Case
2025 Text Translation from Captured Images Coming Soon This work extracts text from images and restores translations onto them, achieving high visual quality.
2025 Table Structure Recognition An effective table structure recognition model and dataset We provide an efficient table structure detection model that effectively handles complex cell merging and cell positioning. It supports output in HTML, JSON, and other custom formats. Demo
2025 OCR Detection and Handwritten Region Awareness Handwritten Awareness and Detection For specific needs, accurate identification of handwritten regions is essential. We offer a model that outputs both OCR detection boxes and a handwriting probability map. The training scripts and dataset will be open-sourced. Demo
2025 Lightweight Facial Landmark Detection Lite Face Keypoint detection We provide a lightweight facial landmark model under 5MB that outputs 106 keypoints. It is trained on six public datasets with enhanced alignment and loss balancing to support joint training across datasets of different scales. Demo
2025 Two LUT Reading Implementations: CPU (Eigen) and GPU (PyTorch) Versions OPENLUT Here are two implementations: one uses Eigen to quickly read LUT files with an average speed of about 0.5 seconds; the other is a GPU-accelerated version based on PyTorch, achieving an average speed of around 0.1 seconds. Demo

Popular repositories Loading

  1. ForCenNet ForCenNet Public

    81 3

  2. lite_face_kyepoint lite_face_kyepoint Public

    Python 18

  3. Handwritten-Awareness-and-Detection Handwritten-Awareness-and-Detection Public

    For specific needs, accurate identification of handwritten regions is essential. We offer a model that outputs both OCR detection boxes and a handwriting probability map. The training scripts and d…

    Python 13

  4. wired_table_rec wired_table_rec Public

    Recognition model for wireframed tables

    Python 8 1

  5. caipeng328 caipeng328 Public

    1

  6. OPENLUT OPENLUT Public

    C++ 1