unicode - Aegisub 手册

The unicode module for Automation 4 Lua contains various helper functions for working with UTF-8 encoded text.

Usage

Import this module with unicode = require 'aegisub.unicode'.

unicode.charwidth

Synopsis: width = unicode.charwidth(instring, index=1)

Returns the number of bytes occupied by the UTF-8 encoded code points starting at position index in instring. The character pointed to is assumed to be a prefix byte (i.e. the first byte of the code points).

The index parameter is optional abd defaults to 1 (one) when left out, meaning the width of the first character in instring will be returned.

unicode.chars

Synopsis: for char in unicode.chars(instring) do ... end

Returns an iterator function for looping over all code points in the given UTF-8 encoded string. For each iteration of the loop, char will contain a string representing the next code point in the string. This string may be more than one byte long.

unicode.len

Synopsis: length = unicode.len(instring)

Determine the length in code points of the given UTF-8 encoded string.

Be aware that this function does not run in constant time, but in linear time (O(N)) proportional to the number of Unicode code points in instring.

unicode.codepoint

Synopsis: val = unicode.codepoint(instring)

Read the first unicode codepoint from instring.

自动化

概述:	自动化脚本管理器 • 运行宏 • 使用导出滤镜 • 示例宏
卡拉OK模版执行器相关:	声明模版 • 模版执行顺序 • 修饰语 • 内联变量($变量) code行和code区 • 代码执行环境
Lua API 相关:	注册 • 字幕对象 • 进度反馈 • 对话框 • 其他APIs
Lua 模块:	karaskel.lua • util • unicode • cleantags.lua • clipboard • re
Karaskel 概念:	Style tables • Dialogue line tables • Syllable tables • 内联特效 • 注音假名