TimeSeries

Time Series Overview

Posted on November 21, 2022 by ตั้ม

Time Series Data คือ ข้อมูลที่ประกอบด้วยข้อมูลย่อยหลายข้อมูล (หลายตัวอย่าง) แต่ละข้อมูลย่อยกำกับด้วยวันเวลา โดยข้อมูลทั้งหมดเรียงลำดับตามวันเวลา ตัวอย่างเช่น ข้อมูลราคาหุ้น, ข้อมูลพยากรณ์อากาศ, ข้อมูลจากเซ็นเซอร์, ข้อมูล log, ข้อมูลทรานแซกชั่น ประเภทข้อมูล time series มักแบ่งตามคุณลักษณะ ซึ่งคุณลักษณะที่เกี่ยวกับการลงทุนที่มักมีการกล่าวถึงบ่อยครั้งคือ ข้อมูลประเภท stationary กับ non-stationary การทำนายข้อมูล time series ที่ให้ความแม่นยำสูงมักเป็นประเภท stationary ซึ่งหมายถึงข้อมูลที่มีวัฏจักรสม่ำเสมอ เช่น ฤดูกาล, การออกดอก/ผลตามฤดูกาลของพืชผลทางการเกษตร, การจ่ายปันผลประจำปี, ค่าน้ำ/ค่าไฟ ข้อมูลลักษณะแบบนี้มักคาดการณ์ได้ง่าย เช่น หากเดือนที่แล้วมีค่าไฟ 1,000 บาท หากเดือนนี้ใช้ไฟใกล้เคียงกับเดือนที่แล้ว เดือนนี้ก็อาจมีค่าไฟใกล้เคียง 1,000 บาท…

Category: Level 1 - Foundation Tags: TimeSeries

Time series feature engineering

Posted on September 7, 2021 by ตั้ม

Leave a Comment

search อากู๋คำว่า ‘time series feature engineering for stock market’ เจอผลลัพธ์เจ๋งๆ มากมาย (การทำโมเดล จะ รวย/จน, แม่น/ไม่แม่น, เวิร์ก/ไม่เวิร์ก สำคัญที่สุดอยู่ที่ feature engineering และ model architecture นะครับ) ผลลัพธ์เจ๋งๆ เช่น https://towardsdatascience.com/the-importance-of-feature-engineering-for-financial-time-series-forecasting-a1163efe8b8a https://www.analyticsvidhya.com/blog/2019/12/6-powerful-feature-engineering-techniques-time-series/ https://towardsdatascience.com/top-4-time-series-feature-engineering-lessons-from-kaggle-ca2d4c9cbbe7 https://www.kdnuggets.com/2017/11/automated-feature-engineering-time-series-data.html https://alphascientist.com/feature_engineering.html https://mdpi-res.com/d_attachment/entropy/entropy-22-01162/article_deploy/entropy-22-01162-v3.pdf ตัวอย่าง feature engineering แนว domain specific เช่นMACD กำหนดให้มี 4 ค่า1 = macd >=…

Category: articles Tags: FeatureEngineering, level3, Model, TimeSeries

เหตุผลที่เลือกใช้ InfluxDB เก็บราคาและ time series data

Posted on August 15, 2021 by minimallife

Leave a Comment

เราใช้ influxdb (https://www.influxdata.com/products/influxdb-overview/) ในระบบเรา (รันอยู่ใน docker) ในการเก็บข้อมูลประเภท time series data เช่น ราคา, trade action, trade statistics, trade position ฯ

เครดิตรูปภาพจาก https://medium.com/analytics-vidhya/watch-your-stock-shares-with-grafana-and-influxdb-4df7a99c6d64

ส่วนเหตุผล (design rationale) ที่ผมเลือกใช้ InfluxDB คือ

เป็น time series database อันดับต้นๆ เลย
doc ดีมาก มีตัวอย่างเยอะ มี REST API
คนใช้เยอะ benchmark ดี
ผมถามเพื่อนๆ น้องๆ ที่รู้จัก ที่เค้าใช้ในสเกลใหญ่ๆ เค้าแนะนำ influxdb
มันขึ้นชื่อเรื่องเร็วมาก
คำสั่งเป็น SQL คล้าย SQL ใน RDBMS ทำให้เรียนรู้ไม่ยาก
รองรับฟังก์ชั่น, การคำนวณ และการทำงานแบบ time series เช่น การ aggregate ตามเวลา, insert or update โดยเช็กจาก time ซึ่งอันนี้สะดวกมาก เหมาะกับงานเราสุดๆ เวลามีราคาใหม่ฟีดเข้ามา แต่แท่งยังไม่จบก็ update ทับ แต่ถ้าขึ้นแท่งราคาใหม่ก็ insert
รูปแบบการตั้ง index (tag) ก็เหมาะกับงานเราดี
มี official python library และอัพเดตบ่อยใช้ได้

Category: 3. Data Feed & Basic Market Access, Level 5 - IT & architecture Tags: InfluxDB, level5, TimeSeries, TimeSeriesData