Conversation
|
@bariew See d4h/php-finediff. The |
|
Sorry if this is a stupid question. I was able to diff text containing for example |
|
If you would diff 2 strings with 4 bytes (2 multibyte characters), and only the first byte of the 4 bytes is the same in both strings, then this would trip. |
|
I did a test with two strings: @hannesvdvreken, you are right that the diff fails at character level. If I do word-level diff, it works. For my application, I'm using only word level diff. Can I therefore ignore mb_str? Thanks. |
I guess so. Edges of words are defined by spaces and spaces are not multibyte. If words defined by multibytes are the same, then words defined by bytes are the same too. |
This is for non-english strings encoding support