It is better to use APIs provided by stable large companies, without keys, directly get the text submitted via GET and return an audio file, which is convenient for putting into the front-end tag directly using. For the English tool I made before:
Learn 100 sentences in 7000 words of雅思 vocabulary