Is there an available API that can be used to get a sense of voice recognition?

I’m exploring enabling speech-to-commands processing for a game, but would like to try and do a baseline of voice recognition within that to allow two people in close proximity to interact , but not interfere with each others voice commands to this system.

(it’s for an accessible game idea)