UTF-8 Validation
A character in UTF8 can be from1 to 4 byteslong, subjected to the following rules:
For 1-byte character, the first bit is a 0, followed by its unicode code.
For n-bytes character, the first n-bits are all one's, the n+1 bit is 0, followed by n-1 bytes with most significant 2 bits being 10.
This is how the UTF-8 encoding would work:
Given an array of integers representing the data, return whether it is a valid utf-8 encoding.
Note: The input is an array of integers. Only theleast significant 8 bitsof each integer is used to store the data. This means each integer represents only 1 byte of data.
Example 1:
Example 2:
Solution
Last updated