Files
Nim/unidecode.html
github-actions[bot] ef3d9e7b38 Deploy to GitHub pages
2025-12-28 15:55:52 +00:00

145 lines
12 KiB
HTML

<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "https://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<!-- This file is generated by Nim. -->
<html xmlns="https://www.w3.org/1999/xhtml" xml:lang="en" lang="en" data-theme="auto">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1.0">
<title>std/unidecode</title>
<!-- Google fonts -->
<link href='https://fonts.googleapis.com/css?family=Lato:400,600,900' rel='stylesheet' type='text/css'/>
<link href='https://fonts.googleapis.com/css?family=Source+Code+Pro:400,500,600' rel='stylesheet' type='text/css'/>
<!-- Favicon -->
<link rel="shortcut icon" href=""/>
<link rel="icon" type="image/png" sizes="32x32" href="">
<!-- CSS -->
<link rel="stylesheet" type="text/css" href="nimdoc.out.css?v=2.3.1">
<!-- JS -->
<script type="text/javascript" src="dochack.js?v=2.3.1"></script>
</head>
<body>
<div class="document" id="documentId">
<div class="container">
<h1 class="title">std/unidecode</h1>
<div class="row">
<div class="three columns">
<div class="theme-select-wrapper">
<label for="theme-select">Theme:&nbsp;</label>
<select id="theme-select" onchange="setTheme(this.value)">
<option value="auto">🌗 Match OS</option>
<option value="dark">🌑 Dark</option>
<option value="light">🌕 Light</option>
</select>
</div>
<div id="global-links">
<ul class="simple-boot">
<li><a href="manual.html">Manual</a></li>
<li><a href="lib.html">Standard library</a></li>
<li> <a id="indexLink" href="theindex.html">Index</a></li>
<li><a href="compiler/theindex.html">Compiler docs</a></li>
<li><a href="https://nim-lang.github.io/fusion/theindex.html">Fusion docs</a></li>
<li><a href="https://nim-lang.github.io/Nim/">devel</a>, <a href="https://nim-lang.org/documentation.html">stable</a></li>
</ul>
</div>
<div id="searchInputDiv">
Search: <input type="search" id="searchInput"
oninput="search()" />
</div>
<ul class="simple simple-toc" id="toc-list">
<li>
<a class="reference reference-toplevel" href="#6" id="56">Imports</a>
</li>
<li>
<details open>
<summary><a class="reference reference-toplevel" href="#12" id="62">Procs</a></summary>
<ul class="simple simple-toc-section">
<ul class="simple nested-toc-section">loadUnidecodeTable
<li><a class="reference" href="#loadUnidecodeTable%2Cstring" title="loadUnidecodeTable(datafile = &quot;unidecode.dat&quot;)">loadUnidecodeTable(datafile = &quot;unidecode.dat&quot;)</a></li>
</ul>
<ul class="simple nested-toc-section">unidecode
<li><a class="reference" href="#unidecode%2Cstring" title="unidecode(s: string): string">unidecode(s: string): string</a></li>
</ul>
</ul>
</details>
</li>
</ul>
</div>
<div class="nine columns" id="content">
<a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L1" class="link-seesrc" target="_blank">Source</a>&nbsp;&nbsp;
<a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L1" class="link-seesrc" target="_blank" >Edit</a>&nbsp;&nbsp;
<div id="tocRoot"></div>
<p class="module-desc"><p>This module is based on Python's <a class="reference external" href="https://pypi.org/project/Unidecode/">Unidecode</a> module by Tomaz Solc, which in turn is based on the <a class="reference external" href="https://metacpan.org/pod/Text::Unidecode">Text::Unidecode</a> Perl module by Sean M. Burke.</p>
<p>It provides a <a class="reference external" href="#unidecode,string">unidecode proc</a> that does Unicode to ASCII transliterations: It finds the sequence of ASCII characters that is the closest approximation to the Unicode string.</p>
<p>For example, the closest to string &quot;Äußerst&quot; in ASCII is &quot;Ausserst&quot;. Some information is lost in this transformation, of course, since several Unicode strings can be transformed to the same ASCII representation. So this is a strictly one-way transformation. However, a human reader will probably still be able to guess from the context, what the original string was.</p>
<p>This module needs the data file <tt class="docutils literal"><span class="pre"><span class="Identifier">unidecode</span><span class="Operator">.</span><span class="Identifier">dat</span></span></tt> to work: This file is embedded as a resource into your application by default. You can also define the symbol <tt class="docutils literal"><span class="pre"><span class="Operator">--</span><span class="Identifier">define</span><span class="Punctuation">:</span><span class="Identifier">noUnidecodeTable</span></span></tt> during compile time and use the <a class="reference external" href="#loadUnidecodeTable">loadUnidecodeTable proc</a> to initialize this module.</p>
</p>
<div class="section" id="6">
<h1><a class="toc-backref" href="#6">Imports</a></h1>
<dl class="item">
<a class="reference external" href="unicode.html">unicode</a>, <a class="reference external" href="strutils.html">strutils</a>
</dl>
</div>
<div class="section" id="12">
<h1><a class="toc-backref" href="#12">Procs</a></h1>
<dl class="item">
<div id="loadUnidecodeTable-procs-all">
<div id="loadUnidecodeTable,string">
<dt><pre><span class="Keyword">proc</span> <a href="#loadUnidecodeTable%2Cstring"><span class="Identifier">loadUnidecodeTable</span></a><span class="Other">(</span><span class="Identifier">datafile</span> <span class="Other">=</span> <span class="StringLit">&quot;unidecode.dat&quot;</span><span class="Other">)</span> {.<span><span class="Other pragmadots">...</span></span><span class="pragmawrap"><span class="Identifier">raises</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">tags</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span>
<span class="Identifier">forbids</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span></span>.}</pre></dt>
<dd>
Loads the datafile that <a class="reference external" href="#unidecode,string">unidecode</a> needs to work. This is only required if the module was compiled with the <tt class="docutils literal"><span class="pre"><span class="Operator">--</span><span class="Identifier">define</span><span class="Punctuation">:</span><span class="Identifier">noUnidecodeTable</span></span></tt> switch. This needs to be called by the main thread before any thread can make a call to <tt class="docutils literal"><span class="pre"><span class="Identifier">unidecode</span></span></tt>.
<a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L41" class="link-seesrc" target="_blank">Source</a>&nbsp;&nbsp;
<a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L41" class="link-seesrc" target="_blank" >Edit</a>&nbsp;&nbsp;
</dd>
</div>
</div>
<div id="unidecode-procs-all">
<div id="unidecode,string">
<dt><pre><span class="Keyword">proc</span> <a href="#unidecode%2Cstring"><span class="Identifier">unidecode</span></a><span class="Other">(</span><span class="Identifier">s</span><span class="Other">:</span> <a href="system.html#string"><span class="Identifier">string</span></a><span class="Other">)</span><span class="Other">:</span> <a href="system.html#string"><span class="Identifier">string</span></a> {.<span><span class="Other pragmadots">...</span></span><span class="pragmawrap"><span class="Identifier">raises</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">tags</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span><span class="Other">,</span> <span class="Identifier">forbids</span><span class="Other">:</span> <span class="Other">[</span><span class="Other">]</span></span>.}</pre></dt>
<dd>
Finds the sequence of ASCII characters that is the closest approximation to the UTF-8 string <tt class="docutils literal"><span class="pre"><span class="Identifier">s</span></span></tt>.
<p><strong class="examples_text">Example:</strong></p>
<pre class="listing"><span class="Identifier">doAssert</span> <span class="Identifier">unidecode</span><span class="Punctuation">(</span><span class="StringLit">&quot;北京&quot;</span><span class="Punctuation">)</span> <span class="Operator">==</span> <span class="StringLit">&quot;Bei Jing &quot;</span>
<span class="Identifier">doAssert</span> <span class="Identifier">unidecode</span><span class="Punctuation">(</span><span class="StringLit">&quot;Äußerst&quot;</span><span class="Punctuation">)</span> <span class="Operator">==</span> <span class="StringLit">&quot;Ausserst&quot;</span></pre>
<a href="https://github.com/nim-lang/Nim/tree/devel/lib/pure/unidecode/unidecode.nim#L53" class="link-seesrc" target="_blank">Source</a>&nbsp;&nbsp;
<a href="https://github.com/nim-lang/Nim/edit/devel/lib/pure/unidecode/unidecode.nim#L53" class="link-seesrc" target="_blank" >Edit</a>&nbsp;&nbsp;
</dd>
</div>
</div>
</dl>
</div>
</div>
</div>
<div class="twelve-columns footer">
<span class="nim-sprite"></span>
<br>
<small style="color: var(--hint);">Made with Nim. Generated: 2025-12-28 15:55:11 UTC</small>
</div>
</div>
</div>
<script defer data-domain="nim-lang.org" src="https://plausible.io/js/plausible.js"></script>
</body>
</html>