Ticket #87 (closed defect: fixed)

Opened 2 years ago

Last modified 2 years ago

Support legacy charsets in subtitles

Reported by: astrange Assigned to: astrange
Priority: normal Milestone: 1.1
Component: Subtitles Version:
Severity: normal Keywords:
Cc:

Description

External subtitle files have no required character encoding; OGM continues this, though Matroska defines UTF-8. We should support them somehow, even though anything that isn't Unicode sucks.

I'll look into using the automatic detection code from http://wakaba.c3.cx/s/apps/unarchiver.html (if the license is compatible; I think it's the Firefox license), but it might fail in some cases for minor languages like Arabic.

Change History

02/09/07 14:54:12 changed by tick

  • milestone set to 1.1.

02/13/07 02:35:58 changed by astrange

(In [335]) Attempt to handle non-UTF8 subtitles, this time in the new infrastructure. Patch by Brian Foley. (refs #87)

Overlap handling should seriously work now.

04/11/07 15:25:20 changed by astrange

(In [406]) Use the (LGPL) Mozilla automatic charset detection library for external subtitles. Unfortunately, it's quite large... References #87

04/15/07 21:13:37 changed by astrange

  • status changed from new to closed.
  • resolution set to fixed.

Probably fixed now.