Skip to content

Commit c56647a

Browse files
feat(texttospeech): update the api
#### texttospeech:v1 The following keys were added: - schemas.SynthesisInput.properties.markup.type (Total Keys: 1) #### texttospeech:v1beta1 The following keys were added: - schemas.SynthesisInput.properties.markup.type (Total Keys: 1)
1 parent 96fc44c commit c56647a

File tree

6 files changed

+26
-6
lines changed

6 files changed

+26
-6
lines changed

docs/dyn/texttospeech_v1.projects.locations.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,7 @@ <h3>Method Details</h3>
121121
},
122122
],
123123
},
124+
&quot;markup&quot;: &quot;A String&quot;, # Markup for HD voices specifically. This field may not be used with any other voices.
124125
&quot;multiSpeakerMarkup&quot;: { # A collection of turns for multi-speaker synthesis. # The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis.
125126
&quot;turns&quot;: [ # Required. Speaker turns.
126127
{ # A multi-speaker turn.

docs/dyn/texttospeech_v1.text.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -118,6 +118,7 @@ <h3>Method Details</h3>
118118
},
119119
],
120120
},
121+
&quot;markup&quot;: &quot;A String&quot;, # Markup for HD voices specifically. This field may not be used with any other voices.
121122
&quot;multiSpeakerMarkup&quot;: { # A collection of turns for multi-speaker synthesis. # The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis.
122123
&quot;turns&quot;: [ # Required. Speaker turns.
123124
{ # A multi-speaker turn.

docs/dyn/texttospeech_v1beta1.projects.locations.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,7 @@ <h3>Method Details</h3>
121121
},
122122
],
123123
},
124+
&quot;markup&quot;: &quot;A String&quot;, # Markup for HD voices specifically. This field may not be used with any other voices.
124125
&quot;multiSpeakerMarkup&quot;: { # A collection of turns for multi-speaker synthesis. # The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis.
125126
&quot;turns&quot;: [ # Required. Speaker turns.
126127
{ # A multi-speaker turn.

docs/dyn/texttospeech_v1beta1.text.html

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -121,6 +121,7 @@ <h3>Method Details</h3>
121121
},
122122
],
123123
},
124+
&quot;markup&quot;: &quot;A String&quot;, # Markup for HD voices specifically. This field may not be used with any other voices.
124125
&quot;multiSpeakerMarkup&quot;: { # A collection of turns for multi-speaker synthesis. # The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis.
125126
&quot;turns&quot;: [ # Required. Speaker turns.
126127
{ # A multi-speaker turn.

googleapiclient/discovery_cache/documents/texttospeech.v1.json

Lines changed: 11 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -318,7 +318,7 @@
318318
}
319319
}
320320
},
321-
"revision": "20250415",
321+
"revision": "20250424",
322322
"rootUrl": "https://texttospeech.googleapis.com/",
323323
"schemas": {
324324
"AdvancedVoiceOptions": {
@@ -403,12 +403,16 @@
403403
"enum": [
404404
"PHONETIC_ENCODING_UNSPECIFIED",
405405
"PHONETIC_ENCODING_IPA",
406-
"PHONETIC_ENCODING_X_SAMPA"
406+
"PHONETIC_ENCODING_X_SAMPA",
407+
"PHONETIC_ENCODING_JAPANESE_YOMIGANA",
408+
"PHONETIC_ENCODING_PINYIN"
407409
],
408410
"enumDescriptions": [
409411
"Not specified.",
410412
"IPA, such as apple -> \u02c8\u00e6p\u0259l. https://en.wikipedia.org/wiki/International_Phonetic_Alphabet",
411-
"X-SAMPA, such as apple -> \"{p@l\". https://en.wikipedia.org/wiki/X-SAMPA"
413+
"X-SAMPA, such as apple -> \"{p@l\". https://en.wikipedia.org/wiki/X-SAMPA",
414+
"For reading-to-pron conversion to work well, the `pronunciation` field should only contain Kanji, Hiragana, and Katakana. The pronunciation can also contain pitch accents. The start of a pitch phrase is specified with `^` and the down-pitch position is specified with `!`, for example: phrase:\u7aef pronunciation:^\u306f\u3057 phrase:\u7bb8 pronunciation:^\u306f!\u3057 phrase:\u6a4b pronunciation:^\u306f\u3057! We currently only support the Tokyo dialect, which allows at most one down-pitch per phrase (i.e. at most one `!` between `^`).",
415+
"Used to specify pronunciations for Mandarin words. See https://en.wikipedia.org/wiki/Pinyin. For example: \u671d\u9633, the pronunciation is \"chao2 yang2\". The number represents the tone, and there is a space between syllables. Neutral tones are represented by 5, for example \u5b69\u5b50 \"hai2 zi5\"."
412416
],
413417
"type": "string"
414418
},
@@ -608,6 +612,10 @@
608612
"$ref": "CustomPronunciations",
609613
"description": "Optional. The pronunciation customizations are applied to the input. If this is set, the input is synthesized using the given pronunciation customizations. The initial support is for en-us, with plans to expand to other locales in the future. Instant Clone voices aren't supported. In order to customize the pronunciation of a phrase, there must be an exact match of the phrase in the input types. If using SSML, the phrase must not be inside a phoneme tag."
610614
},
615+
"markup": {
616+
"description": "Markup for HD voices specifically. This field may not be used with any other voices.",
617+
"type": "string"
618+
},
611619
"multiSpeakerMarkup": {
612620
"$ref": "MultiSpeakerMarkup",
613621
"description": "The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis."

googleapiclient/discovery_cache/documents/texttospeech.v1beta1.json

Lines changed: 11 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -261,7 +261,7 @@
261261
}
262262
}
263263
},
264-
"revision": "20250415",
264+
"revision": "20250424",
265265
"rootUrl": "https://texttospeech.googleapis.com/",
266266
"schemas": {
267267
"AdvancedVoiceOptions": {
@@ -342,12 +342,16 @@
342342
"enum": [
343343
"PHONETIC_ENCODING_UNSPECIFIED",
344344
"PHONETIC_ENCODING_IPA",
345-
"PHONETIC_ENCODING_X_SAMPA"
345+
"PHONETIC_ENCODING_X_SAMPA",
346+
"PHONETIC_ENCODING_JAPANESE_YOMIGANA",
347+
"PHONETIC_ENCODING_PINYIN"
346348
],
347349
"enumDescriptions": [
348350
"Not specified.",
349351
"IPA, such as apple -> \u02c8\u00e6p\u0259l. https://en.wikipedia.org/wiki/International_Phonetic_Alphabet",
350-
"X-SAMPA, such as apple -> \"{p@l\". https://en.wikipedia.org/wiki/X-SAMPA"
352+
"X-SAMPA, such as apple -> \"{p@l\". https://en.wikipedia.org/wiki/X-SAMPA",
353+
"For reading-to-pron conversion to work well, the `pronunciation` field should only contain Kanji, Hiragana, and Katakana. The pronunciation can also contain pitch accents. The start of a pitch phrase is specified with `^` and the down-pitch position is specified with `!`, for example: phrase:\u7aef pronunciation:^\u306f\u3057 phrase:\u7bb8 pronunciation:^\u306f!\u3057 phrase:\u6a4b pronunciation:^\u306f\u3057! We currently only support the Tokyo dialect, which allows at most one down-pitch per phrase (i.e. at most one `!` between `^`).",
354+
"Used to specify pronunciations for Mandarin words. See https://en.wikipedia.org/wiki/Pinyin. For example: \u671d\u9633, the pronunciation is \"chao2 yang2\". The number represents the tone, and there is a space between syllables. Neutral tones are represented by 5, for example \u5b69\u5b50 \"hai2 zi5\"."
351355
],
352356
"type": "string"
353357
},
@@ -541,6 +545,10 @@
541545
"$ref": "CustomPronunciations",
542546
"description": "Optional. The pronunciation customizations are applied to the input. If this is set, the input is synthesized using the given pronunciation customizations. The initial support is for en-us, with plans to expand to other locales in the future. Instant Clone voices aren't supported. In order to customize the pronunciation of a phrase, there must be an exact match of the phrase in the input types. If using SSML, the phrase must not be inside a phoneme tag."
543547
},
548+
"markup": {
549+
"description": "Markup for HD voices specifically. This field may not be used with any other voices.",
550+
"type": "string"
551+
},
544552
"multiSpeakerMarkup": {
545553
"$ref": "MultiSpeakerMarkup",
546554
"description": "The multi-speaker input to be synthesized. Only applicable for multi-speaker synthesis."

0 commit comments

Comments
 (0)