大四老屁股的一些小技巧

在似乎完成了很多但又似乎沒達成什麼的感覺中,大學生活逐漸步入尾聲。回顧這幾年學到的東西,沒幾個可以拿出來說嘴,更別說是對每個人都有用的能力或技巧。不過有些小技巧似乎過於「微不足道」,反而沒人意識到這些技巧也是花時間摸索出來的。

Read more

Customizing Hugo / Blogdown RSS Templates

Blogdown makes it easy to create Hugo blogs or personal websites, and it is becoming more and more popular in the R community. Once the blog is created, people might want to submit their blogs’ RSS feeds to R-bloggers. But before that can happen, one must modify the RSS template to meet the requirements of RSS submission.

Read more

Rendering IPA Symbols in R Markdown

I was thinking about promoting reproducible research in Linguistics, or more precisely, how to attract people with no programming skills to have incentives to learn at least a bit programming, so that they have the ability to make their research more reproducible.

Read more

jieba 自訂詞庫斷詞

在進行中文 Text Mining 前處理時,必須先經過斷詞處理。社群當中存在相當好的斷詞處理工具,如 jieba。但斷詞時常遇到一個問題:文本中重要的詞彙因為不常見於其它地方而被斷開,像是人物角色名稱。要處理這個問題,需將自訂詞庫提供給斷詞套件,才不會將重要詞彙斷開。

Read more