ryanontheinside
diff --git a/‎README.MD
Lines changed: 100 additions & 17 deletions b/‎README.MD
Lines changed: 100 additions & 17 deletions
diff --git a/‎examples/broccoli.png
383 KB b/‎examples/broccoli.png
383 KB
diff --git a/‎examples/BROCCOLI_GAME.json renamed to ‎examples/control_nodes/BROCCOLI_GAME.json b/‎examples/BROCCOLI_GAME.json renamed to ‎examples/control_nodes/BROCCOLI_GAME.json
diff --git a/‎examples/FPS_Monitor_API.json renamed to ‎examples/control_nodes/FPS_Monitor_API.json b/‎examples/FPS_Monitor_API.json renamed to ‎examples/control_nodes/FPS_Monitor_API.json
diff --git a/‎examples/FPS_Monitor_WORKFLOW.json renamed to ‎examples/control_nodes/FPS_Monitor_WORKFLOW.json b/‎examples/FPS_Monitor_WORKFLOW.json renamed to ‎examples/control_nodes/FPS_Monitor_WORKFLOW.json
diff --git a/‎examples/hand_tracking_mask_resizer.json renamed to ‎examples/control_nodes/hand_tracking_mask_resizer.json
Lines changed: 1 addition & 1 deletion b/‎examples/hand_tracking_mask_resizer.json renamed to ‎examples/control_nodes/hand_tracking_mask_resizer.json
Lines changed: 1 addition & 1 deletion
diff --git a/‎examples/mask_string.json renamed to ‎examples/control_nodes/mask_string.json
Lines changed: 1 addition & 1 deletion b/‎examples/mask_string.json renamed to ‎examples/control_nodes/mask_string.json
Lines changed: 1 addition & 1 deletion
diff --git a/‎examples/motioncontrol.json renamed to ‎examples/control_nodes/motioncontrol.json b/‎examples/motioncontrol.json renamed to ‎examples/control_nodes/motioncontrol.json
diff --git a/‎examples/motioncontrol_example_API.json renamed to ‎examples/control_nodes/motioncontrol_example_API.json
Lines changed: 1 addition & 1 deletion b/‎examples/motioncontrol_example_API.json renamed to ‎examples/control_nodes/motioncontrol_example_API.json
Lines changed: 1 addition & 1 deletion
diff --git a/‎examples/on_off_example_api.json renamed to ‎examples/control_nodes/on_off_example_api.json b/‎examples/on_off_example_api.json renamed to ‎examples/control_nodes/on_off_example_api.json
diff --git a/‎examples/prompt_updating_WORKFLOW.json renamed to ‎examples/control_nodes/prompt_updating_WORKFLOW.json b/‎examples/prompt_updating_WORKFLOW.json renamed to ‎examples/control_nodes/prompt_updating_WORKFLOW.json
diff --git a/‎examples/shape_resize_example_API.json renamed to ‎examples/control_nodes/shape_resize_example_API.json b/‎examples/shape_resize_example_API.json renamed to ‎examples/control_nodes/shape_resize_example_API.json
diff --git a/‎examples/shape_resize_example_WORKFLOW.json renamed to ‎examples/control_nodes/shape_resize_example_WORKFLOW.json b/‎examples/shape_resize_example_WORKFLOW.json renamed to ‎examples/control_nodes/shape_resize_example_WORKFLOW.json
diff --git a/‎examples/similarity_filter_example_api.json renamed to ‎examples/control_nodes/similarity_filter_example_api.json b/‎examples/similarity_filter_example_api.json renamed to ‎examples/control_nodes/similarity_filter_example_api.json
diff --git a/‎examples/simple_comfy_api.json renamed to ‎examples/control_nodes/simple_comfy_api.json b/‎examples/simple_comfy_api.json renamed to ‎examples/control_nodes/simple_comfy_api.json
diff --git a/‎examples/dead_inside_512.png
-267 KB b/‎examples/dead_inside_512.png
-267 KB
diff --git a/‎examples/harold.png
288 KB b/‎examples/harold.png
288 KB
diff --git a/‎examples/mediapipe_vision/blendshape_control.json
Lines changed: 191 additions & 0 deletions b/‎examples/mediapipe_vision/blendshape_control.json
Lines changed: 191 additions & 0 deletions
@@ -4,7 +4,8 @@ A growing suite of nodes for real-time ComfyUI workflows. Features include value
 
 The intention for this repository is to build a suite of nodes that can be used in the burgeoning real-time diffusion space. Contributions are welcome!
 
-## Nodes
+
+## Control Nodes
 
 ### Value Controls 🎚️
 - **FloatControl**: Outputs a floating point value that changes over time using various patterns (sine wave, bounce, random walk, etc).
@@ -27,17 +28,11 @@ The intention for this repository is to build a suite of nodes that can be used
 - **DTypeConverter**: Convert masks between different data types (float16, uint8, float32, float64).
 - **FastWebcamCapture**: High-performance webcam capture node with resizing capabilities.
 - **SimilarityFilter**: Filter out similar consecutive images and control downstream execution. Perfect for optimizing real-time workflows by skipping redundant processing of similar frames.
+
+### Logic 🧠
 - **LazyCondition**: Powerful conditional execution node that supports any input type. Uses lazy evaluation to truly skip execution of unused paths and maintains state to avoid feedback loops.
 
-## Movement Patterns 🔄
 
-All value and motion controls support various movement patterns:
-- **Sine**: Smooth sinusoidal motion
-- **Triangle**: Linear interpolation with smooth direction changes
-- **Sawtooth**: Linear interpolation with sharp resets
-- **Square**: Instant transitions between min/max values
-- **Static**: No movement (constant value)
-- **and more**
 
 ## Usage 📖
 
@@ -65,6 +60,74 @@ Use utility nodes to optimize and control your workflow:
 - **SimilarityFilter**: Skip processing of similar frames by comparing consecutive images. Great for optimizing real-time workflows by only processing frames that have meaningful changes.
 - **LazyCondition**: Create conditional execution paths that truly skip processing of unused branches. Works with any input type (images, latents, text, numbers) and maintains state of the last successful output to avoid feedback loops.
 
+## 🔮 MediaPipe Vision
+
+### ✨ Overview
+
+This repository provides a complete implementation of Google MediaPipe vision tasks for ComfyUI. It enables computer vision capabilities that can be used for interactive AI art, responsive interfaces, motion tracking, and advanced masking workflows.
+
+### 🚀 Features
+
+| Category | Available Tools |
+|----------|-------------|
+| **Face Analysis** | Face detection, face mesh (478 points), blendshapes, head pose |
+| **Body Tracking** | Pose estimation (33 landmarks), segmentation masks |
+| **Hand Analysis** | Hand tracking (21 landmarks per hand), gesture recognition |
+| **Image Processing** | Object detection, image segmentation, image embeddings |
+| **Creative Tools** | Face stylization, interactive segmentation |
+
+### 📋 Supported MediaPipe Tasks
+
+* **Face Detection:** Face bounding boxes and keypoints
+* **Face Landmark Detection:** Face mesh landmarks with expression analysis
+* **Hand Landmark Detection:** Hand position tracking with 21 landmarks
+* **Pose Landmark Detection:** Body pose tracking with 33 landmarks
+* **Object Detection:** Common object detection using models like EfficientDet
+* **Image Segmentation:** Category-based image segmentation
+* **Gesture Recognition:** Recognition of common hand gestures
+* **Image Embedding:** Feature vector generation for image similarity
+* **Interactive Segmentation:** User-guided image masking
+* **Face Stylization:** Artistic style application to faces
+* **Holistic Landmark Detection:** Full-body landmark detection (legacy)
+
+> **Note:** Holistic landmark detection uses the legacy MediaPipe API as we await the official Tasks API release.
+
+### ⚙️ Landmark System
+
+The project's landmark system allows extracting and using position data:
+
+#### Position Extraction
+
+**Landmark Position Extractors** access coordinate data from any landmark:
+- Extract x, y, z positions from face, hand, or pose landmarks
+- Access visibility and presence information where available
+- Access world coordinates when available (hand and pose)
+- Input landmark indices directly to access any point
+- Process batches for multi-frame workflows
+
+#### Position Processing
+
+Several node types work with landmark position data:
+
+- **Delta Controls** - Track movement and map changes to parameter values
+- **Proximity Nodes** - Calculate distances between landmarks
+- **Masking Nodes** - Generate masks centered at landmark positions
+- **Head Pose Extraction** - Calculate yaw, pitch, roll from face landmarks
+- **Blendshape Analysis** - Extract facial expression parameters
+
+### Example Workflow
+
+```
+Load Face Landmarker → Face Landmarker ← Image Input
+           |
+           ↓ landmarks
+Face Landmark Position (Index: 1) → x,y,z coordinates
+           |
+           ↓ x,y,z
+Position Delta Float Control → value → ComfyUI Parameter
+```
+
+
 ## Examples 🎬
 
 ### Value Control Demo
@@ -97,6 +160,7 @@ git clone https://github.com/ryanontheinside/ComfyUI_RealTimeNodes
 cd ComfyUI_RealTimeNodes
 pip install -r requirements.txt
 ```
+> **Note:** For MediaPipe, GPU Support varies by platform. For Linux, see [these instructions](https://ai.google.dev/edge/mediapipe/framework/getting_started/gpu_support).
 
 ## Coming Soon 🚀
 
@@ -116,19 +180,37 @@ This is an evolving project that aims to expand the real-time capabilities of Co
 
 ### Contributing 🤝
 
-Your feedback and contributions are more than welcome! This project grows stronger with community input.
+This project provides flexible infrastructure for computer vision in ComfyUI. If you have ideas for:
+
+- Creative AI interactions using vision
+- Specific landmark tracking or detection needs
+- Real-time vision workflows
+- Improvements to the current implementation
+
+Please open an issue, even if you're not sure how to implement it. 
+
+The aim is to **iterate quickly** to keep up with this burgeoning field of real-time ComfyUI 
 
-- Have an idea? Open an issue! 💡
-- Found a bug? Open an issue! 🐛
-- Made an improvement? Submit a PR! 🎉
-- Want to help? Join the discussion! 💬
 
 Please visit our [GitHub Issues](https://github.com/ryanontheinside/ComfyUI_RealTimeNodes/issues) page to contribute.
 
 ## Related Projects 🔗
 
-### ComfyUI_RyanOnTheInside - Everything Reactivity ⚡
-Make anything react to anything in your ComfyUI workflows. [ComfyUI_RyanOnTheInside](https://github.com/ryanontheinside/ComfyUI_RyanOnTheInside) - my main custom nodes suite that brings complete reactive control to standard ComfyUI workflows:
+## 🔗 Related Projects
+
+### [ComfyUI_ControlFreak](https://github.com/ryanontheinside/ComfyUI_ControlFreak)
+Universal MIDI & Gamepad Mapping in ComfyUI. Map any MIDI controller or gamepad to any parameter in your ComfyUI workflow for intuitive, hands-on control of your generative art. Perfect for live performances, interactive installations, and streamlined creative workflows.
+
+### [comfystream](https://github.com/yondonfu/comfystream)
+A real-time streaming framework for ComfyUI that enables running workflows continuously on video streams, perfect for 
+combining with MediaPipe vision capabilities.
+
+### [ComfyUI-Stream-Pack](https://github.com/livepeer/ComfyUI-Stream-Pack)
+A collection of ComfyUI nodes for multimedia streaming applications. Combines video processing with generative models 
+for real-time media effects.
+
+### [ComfyUI_RyanOnTheInside](https://github.com/ryanontheinside/ComfyUI_RyanOnTheInside) - Everything Reactivity ⚡
+Make anything react to anything in your ComfyUI workflows.  - my main custom nodes suite that brings complete reactive control to standard ComfyUI workflows:
 
 - Dynamic node relationships
 - React to audio, MIDI, motion, time, depth, color, Whisper, and more
@@ -143,4 +225,5 @@ Make anything react to anything in your ComfyUI workflows. [ComfyUI_RyanOnTheIns
 - Reactive DepthFlow
 - Actually more
 
-Use it alongside these Control Nodes to master parameter control in both the batch and real-time paradigms in ComfyUI! The POWER!!
+Use it alongside these Control Nodes to master parameter control in both the batch and real-time paradigms in ComfyUI! The POWER!!
+
@@ -99,7 +99,7 @@
   },
   "23": {
     "inputs": {
-      "image": "dead_inside_512.png",
+      "image": "harold.png",
       "upload": "image"
     },
     "class_type": "LoadImage",
 
@@ -1,7 +1,7 @@
 {
   "57": {
     "inputs": {
-      "image": "dead_inside_512.png",
+      "image": "harold.png",
       "upload": "image"
     },
     "class_type": "LoadImage"
 
@@ -207,7 +207,7 @@
   },
   "39": {
     "inputs": {
-      "image": "dead_inside_512.png",
+      "image": "harold.png",
       "upload": "image"
     },
     "class_type": "LoadImage",
 
@@ -0,0 +1,191 @@
+{
+  "1": {
+    "inputs": {
+      "image": "broccoli.png",
+      "upload": "image"
+    },
+    "class_type": "LoadImage",
+    "_meta": {
+      "title": "Load Image"
+    }
+  },
+  "3": {
+    "inputs": {
+      "num_faces": 1,
+      "min_face_detection_confidence": 0.5,
+      "min_face_presence_confidence": 0.5,
+      "min_tracking_confidence": 0.5,
+      "output_blendshapes": true,
+      "output_transform_matrix": true,
+      "running_mode": "video",
+      "delegate": "cpu",
+      "image": [
+        "6",
+        0
+      ],
+      "model_info": [
+        "4",
+        0
+      ]
+    },
+    "class_type": "MediaPipeFaceLandmarker",
+    "_meta": {
+      "title": "Face Landmarker (MediaPipe)"
+    }
+  },
+  "4": {
+    "inputs": {
+      "model_variant": "default"
+    },
+    "class_type": "MediaPipeFaceLandmarkerModelLoader",
+    "_meta": {
+      "title": "Load Face Landmarker Model (MediaPipe)"
+    }
+  },
+  "6": {
+    "inputs": {
+      "image": "harold.png",
+      "upload": "image"
+    },
+    "class_type": "PrimaryInputLoadImage",
+    "_meta": {
+      "title": "PrimaryInputLoadImage"
+    }
+  },
+  "7": {
+    "inputs": {
+      "x": 0,
+      "y": 0,
+      "resize_source": false,
+      "destination": [
+        "6",
+        0
+      ],
+      "source": [
+        "1",
+        0
+      ],
+      "mask": [
+        "18",
+        0
+      ]
+    },
+    "class_type": "ImageCompositeMasked",
+    "_meta": {
+      "title": "ImageCompositeMasked"
+    }
+  },
+  "8": {
+    "inputs": {
+      "value": [
+        "21",
+        0
+      ],
+      "width": 512,
+      "height": 512
+    },
+    "class_type": "SolidMask",
+    "_meta": {
+      "title": "SolidMask"
+    }
+  },
+  "15": {
+    "inputs": {
+      "part_name": "FACE_OVAL",
+      "face_landmarks": [
+        "16",
+        0
+      ],
+      "image_for_dimensions": [
+        "6",
+        0
+      ]
+    },
+    "class_type": "MaskFromFaceLandmarks",
+    "_meta": {
+      "title": "Mask From Face Landmarks (MediaPipe)"
+    }
+  },
+  "16": {
+    "inputs": {
+      "num_faces": 1,
+      "min_face_detection_confidence": 0.5,
+      "min_face_presence_confidence": 0.5,
+      "min_tracking_confidence": 0.5,
+      "output_blendshapes": true,
+      "output_transform_matrix": true,
+      "running_mode": "video",
+      "delegate": "cpu",
+      "image": [
+        "6",
+        0
+      ],
+      "model_info": [
+        "17",
+        0
+      ]
+    },
+    "class_type": "MediaPipeFaceLandmarker",
+    "_meta": {
+      "title": "Face Landmarker (MediaPipe)"
+    }
+  },
+  "17": {
+    "inputs": {
+      "model_variant": "default"
+    },
+    "class_type": "MediaPipeFaceLandmarkerModelLoader",
+    "_meta": {
+      "title": "Load Face Landmarker Model (MediaPipe)"
+    }
+  },
+  "18": {
+    "inputs": {
+      "x": 0,
+      "y": 0,
+      "operation": "subtract",
+      "destination": [
+        "8",
+        0
+      ],
+      "source": [
+        "15",
+        0
+      ]
+    },
+    "class_type": "MaskComposite",
+    "_meta": {
+      "title": "MaskComposite"
+    }
+  },
+  "21": {
+    "inputs": {
+      "blendshape_name": "jawOpen",
+      "score_min": 0,
+      "score_max": 1,
+      "output_min_float": 0,
+      "output_max_float": 1,
+      "clamp": true,
+      "blendshapes": [
+        "3",
+        1
+      ]
+    },
+    "class_type": "BlendshapeControlFloat",
+    "_meta": {
+      "title": "Blendshape Control (Float)"
+    }
+  },
+  "25": {
+    "inputs": {
+      "images": [
+        "7",
+        0
+      ]
+    },
+    "class_type": "PreviewImage",
+    "_meta": {
+      "title": "Preview Image"
+    }
+  }
+}
Original file line number	Diff line number	Diff line change
`@@ -1,7 +1,7 @@`
`1`	`1`	`{`
`2`	`2`	`"57": {`
`3`	`3`	`"inputs": {`
`4`		`- "image": "dead_inside_512.png",`
	`4`	`+ "image": "harold.png",`
`5`	`5`	`"upload": "image"`
`6`	`6`	`},`
`7`	`7`	`"class_type": "LoadImage"`