What do the “facial transformation matrix” and “canonical face model” when extracting 3D face poses from MediaPipe?
I’m currently playing around with the MediaPipe face detection tool from Google. I am using it to extract 3D poses from an image containing a face. I’m a little confused about the returned outputs from mediapipe’s detector once run on an image. In particular, the detector returns normalized keypoints (I think I understand this fine, it’s a 478×3 tensor representing the keypoints on the face), face_blendshapes (not too concerned with this atm) and a facial transformation matrix.