Login

Creating a virtual stage when in-person isn’t possible

The Azure Kinect camera captures depth information with an infrared light and that data helps make the AI model more accurate. We used an app called Speaker Recorder to manage two video signals from the Azure Kinect camera, the RGB signal and the depth signal. Once the recording was complete, the AI model was applied through a command line tool. To get the full details on how this all came together, check out the Microsoft AI Lab.

The AI model we used is based on the work recently published by the University of Washington. In their research, the university developed a deep neural network that takes two images, one with a background and another one with a person in it. The output of the neural network is a smooth transparency mask.

This neural network was trained with images where the masking work was done manually. The UW researchers used a dataset provided by Adobe with many images where a designer manually created the transparency mask.

With this approach, the neural network can learn how to smooth areas like hair or lose clothing. However, there are some limitations. If the person is wearing something with a similar color to the background, the system renders it as holes in the image which defeats the illusion.

So, what the UW researchers did is to combine this method with another. A second neural network tries to guess the contour just by looking at the image. In the case of our virtual stage we know that we have a person on screen, so the neural network tries to identify the silhouette of that person. Adding this second neural network eliminates the color transparency issue but the small details like hair or the fingers can be an issue.

So, here’s the interesting part. The UW researchers created an architecture called Context Switching. Depending on the conditions, the system can pick the best solution, getting the best of the two.

In our case, because we are using Azure Kinect, we can go a step farther and replace the second neural network with the silhouette provided by the Kinect, which is much more accurate since it’s coming from the depth information captured.

The model is improved even more with another AI technique called adversarial network. We connect the output of our neural network with another neural network that identifies if an image is fake or real. This makes small variations to the original neural network to fool it. The result is a neural network that can create even more natural images.

https://www.sickgaming.net/blog/2020/05/...-possible/

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Microsoft - There’s no question we are creating massive new opportunity across our	xSicKxBot	0	1,886	09-05-2023, 03:12 PM Last Post: xSicKxBot
	Microsoft - Virtual Reality update for Microsoft Flight Simulator now available	xSicKxBot	0	1,942	12-23-2020, 12:44 AM Last Post: xSicKxBot
	Microsoft - Finding purpose and unlocking potential at Envision virtual event	xSicKxBot	0	1,825	12-17-2020, 08:47 PM Last Post: xSicKxBot
	Microsoft - Accelerate your developer career at next week’s free Ignite virtual event	xSicKxBot	0	1,494	09-18-2020, 08:08 AM Last Post: xSicKxBot
	Microsoft - Education in rural India: going virtual in the village	xSicKxBot	0	1,552	09-02-2020, 11:38 AM Last Post: xSicKxBot
	Microsoft - New Windows Virtual Desktop capabilities now generally available	xSicKxBot	0	1,562	07-27-2020, 09:56 PM Last Post: xSicKxBot
	Microsoft - Microsoft offers free, educational virtual summer camps for students	xSicKxBot	0	1,724	07-17-2020, 07:12 PM Last Post: xSicKxBot
	Microsoft - Reimagining virtual collaboration for the future of work and learning	xSicKxBot	0	1,669	07-09-2020, 08:36 AM Last Post: xSicKxBot
	Microsoft - ‘I’m bored!’ Virtual summer camps to the rescue	xSicKxBot	0	1,524	07-07-2020, 01:23 PM Last Post: xSicKxBot
	Microsoft - Virtual tastings and more: a new world of experiences for old-world wines	xSicKxBot	0	1,588	06-28-2020, 06:34 AM Last Post: xSicKxBot

xSicKxBot