讀書寫作: 7月 2019

7/24/2019

識破萬臉偽裝靠女神卡卡濃妝照，臺大打敗美中俄奪全球第一

余至浩，發表識破萬臉偽裝靠女神卡卡濃妝照，臺大打敗美中俄奪全球第一，iThome，2019-07-05

訓練設計上，分兩階段來完成，第一階段，先以其他資料集抽取人臉特徵，先訓練出一個深度卷積神經網路（DCNN）的模型，可以用來辨識人臉；接下來，再以大會提供的偽裝人臉資料，額外建立一個神經網路區塊，專門用於學習偽裝人臉的辨識。徐宏民進一步補充，設計網路時，不只使用深度學習技術，在進行特徵擷取時，也結合了早期的機器學習的 PCA（主成分分析）技術，將這些人臉資料先經轉換投射到一個新的特徵空間，來學習這些偽臉的主要特徵，再用學到的這些人臉結構、特徵去比對，找出眼前這個人是本人偽裝，還是別人假扮。經過他們訓練過的偽臉演算模型，連化濃妝後的 Lady Gaga 都能認。

對於這次比賽成果，徐宏民表示，這更證明了臺灣人臉辨識研究實力，完全不輸國外頂尖大學，「甚至後來，遇到主辦者，連他都很好奇我們怎麼做，可以做到這麼高的辨識率。」他自豪地說。

AI 決勝關鍵在於晶片

工業技術與資訊月刊，AI 決勝關鍵在於晶片，2019-05-20

AI運作約略可粗分為兩階段，分別是「學習」和「推論」，前者透過機器學習技術，利用大量樣本數據對演算法進行訓練；後者則執行演算法，在終端應用解讀現實的數據。

觀諸國際大廠布局，雲端運算使用的CPU、GPU晶片已被國際大廠把持。吳志毅認為，台灣不一定要搶大廠擅長的高效能運算晶片設計市場，加上國際大廠未來也會借重台積電先進製程，突顯台積電在這場AI戰爭中的重要地位。此外，Google資料中心也採用不少台灣廠商元件和產品，台灣亦具發展優勢。

吳志毅指出，台灣若要切入AI產業，潛在機會在於邊緣運算。隨著AI技術日趨演進，AI由雲端走向裝置端已成必然趨勢，裝置端AI的主要關鍵在於擁有高效能的AI晶片。「台灣在晶片、終端設備與系統具有優勢，也擁有高度的靈活度和彈性，如能配合軟體產業，就有很大的發展空間。」...

Solving the Rubik's Cube with Approximate Policy Iteration

Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi, Solving the Rubik's Cube with Approximate Policy Iteration, ICLR, 2019.

Recently, Approximate Policy Iteration (API) algorithms have achieved superhuman proficiency in two-player zero-sum games such as Go, Chess, and Shogi without human data. These API algorithms iterate between two policies: a slow policy (tree search), and a fast policy (a neural network). In these two-player games, a reward is always received at the end of the game. However, the Rubik’s Cube has only a single solved state, and episodes are not guaranteed to terminate. This poses a major problem for these API algorithms since they rely on the reward received at the end of the game. We introduce Autodidactic Iteration: an API algorithm that overcomes the problem of sparse rewards by training on a distribution of states that allows the reward to propagate from the goal state to states farther away. Autodidactic Iteration is able to learn how to solve the Rubik’s Cube without relying on human data. Our algorithm is able to solve 100% of randomly scrambled cubes while achieving a median solve length of 30 moves — less than or equal to solvers that employ human domain knowledge.

Forest Agostinelli, Stephen McAleer, Alexander Shmakov, Pierre Baldi. Solving the Rubik’s cube with deep reinforcement learning and search. Nature Machine Intelligence, 2019; DOI: 10.1038/s42256-019-0070-z (Data availability)

Airlines are finally fixing the middle seat

Mark Wilson, Airlines are finally fixing the middle seat, Fast Company, 07.18.19

Spotify是如何推薦新歌的？

極客公園，越聽越上癮，Spotify是如何推薦新歌的？，2019.07.18

音樂推薦在Spotify中並不是一個具象的功能，它被融入到了很多更細微的模組中。首先我們可以從時間緯度出發。Made For You是Spotify推薦歌單的一個集合，每個人的都不一樣，它會以每日、每週甚至是每年的頻率為你推薦音樂

Cold start

最後，如果你剛剛接觸Spotify，歌單/音樂庫應該還不夠豐富，這裡分享一個快速填充的方法：首先找到一首你最喜歡的音樂，右鍵選擇進入Song Radio，Spotify會基於這首歌推薦一個包含50首音樂的歌單，你可以選擇將這份歌單保存到自己的音樂庫中；對這份歌單進行篩選，將喜歡的音樂保存到音樂庫（紅心），同時繼續右鍵進入Song Radio，重複多次你的歌單/音樂庫就會豐富很多，Spotify也會更加瞭解你的音樂口味。

Mapping roads through deep learning and weakly supervised training

Saikat Basu, Derrick Bonafilia, James Gill, Danil Kirsanov, and David Yang, Mapping roads through deep learning and weakly supervised training, Facebook, July 23, 2019.

We collected our training data as a set of 2,048-by-2,048-pixel tiles, with a resolution of approximately 24 inches per pixel. We discarded tiles where fewer than 25 roads had been mapped, because we found that they often included only major roads (with no examples of smaller roads that would be more challenging to label correctly). For each remaining tile, we rasterized the road vectors and used the resulting mask as our training label. To work at the same resolution as the DeepGlobe data set, we randomly cropped each image to 1,024 by 1,024 pixels, thereby producing roughly 1.8 million tiles covering more than 700,000 square miles of terrain. The result was 1,000x more than the roughly 630 square miles that the DeepGlobe data set covered. To create segmentation masks from these road vectors, we simply rasterized each road vector to five pixels. Semantic segmentation labels tend to be pixel-perfect, but the labels we create with this heuristic are not. Roads vary in width and contour in ways that these rasterized vectors could not capture perfectly. Furthermore, roads in different regions around the globe are mapped from different satellite imagery sources and thus do not always align completely with the imagery we use for our training data.

損益平衡點

聊天得知同學畢業後想開餐飲店，問他會不算 (剛修過的) 生管的損益平衡點，引發我出了一題作業。有次和在外工作多年的朋友聊天，他想開咖啡店，我用此方法算出的損益平衡點和朋友相近。

更詳盡的分析，請參見
果漾水豬，分享一下開了兩年咖啡店的心得有心想開咖啡店的進來

7/06/2019

日本限令對韓國半導體業的影響

陳達誠編譯，日本為何能掌握韓國半導體命脈？，鉅亨網，2019/07/03

日本政府在出口到韓國的半導體相關先進材料將進行管制，自 7 月 4 日起將取消韓國的最惠國待遇，在氟化聚醯亞胺、光阻劑、高純度氟化氫，這三種 OLED 面板及半導體生產上不可或缺的原料出口，將從原先的免申請出口許可，改為逐案審核。而在相關的審查作業上，最長將花費 90 個工作天。

此消息一出，引起韓國一片嘩然。根據韓國媒體報導，該國半導體產業對日本的依存度極高，在製造機台方面，韓國國產品比例只有 2 成不到。在剩餘的 8 成左右，則都仰賴自日本、美國，以及荷蘭等國進口。

韓國是全球最大的半導體生產國，然而生產原料及設備都極度依賴進口。韓國的半導體生產設備自製率只有 18.2%，而在原料方面則達到 50.3%(2017 年時)。而在半導體的相關原料上，有近 50% 依賴自日本進口。

7/02/2019

LIS 線上教學平台

LIS，聲音的傳遞-波以耳，2018年8月28日 (more)

一塊錢電池充手機

林秀姿，戶頭剩1千元他堅持理念要做一輩子的教育，聯合報，2018/12/18

嚴天浩大學考進成大化學系 ...

「我曾經連續吃了3個月的泡麵才有今天，」陳爸從LIS起步時就大力支持，是整個團隊的大恩人。嚴天浩聽了恩人的話後恍然大悟：「有道理，頂多吃泡麵而已，假如我們相信這個價值，假如這幾個月，我們幾個人可以影響很多很多的孩子，那吃泡麵也值得。」

嚴天浩常想，假如求學階段，有人提供這樣的學習資源給他，他的學習過程會比較快樂，因為這股信念，他們拒絕補習班買斷，仍想把影片提供給學習弱勢的孩子，「很多人會說現在的社會，反映出的是20年前的教育，因為當初的學生，長大後就是我們社會現在的樣子。我們想改變的，就是20年後的社會。」

科學教材集資計畫，請大家幫忙。

7/01/2019

AIQ 的時代

何玉方譯，AIQ：不管你願不願意，現在已是AIQ比IQ、EQ更重要的時代，商業周刊，2019
Nick Polson and James Scott, AIQ: How People and Machines Are Smarter Together, St. Martin's Press, 2018.

1. AI關鍵發展史上，7個人類智慧影響人工智慧的故事
2. 解讀促進AI發展的4大元素
3. 機器智慧（machine intelligence）新解！借助人工智慧之力「放大」人類智慧

透過歷史人物故事和基礎數學，說明人工智慧的現況與應用。適合當成大學通識課程的課本，或是高中數學的課外讀物。

訂閱：文章 (Atom)

讀書寫作