{"id":84096,"date":"2025-01-17T17:51:22","date_gmt":"2025-01-17T10:51:22","guid":{"rendered":"https:\/\/itviec.com\/blog\/?p=84096"},"modified":"2026-04-06T10:20:39","modified_gmt":"2026-04-06T03:20:39","slug":"cau-hoi-phong-van-data-analyst","status":"publish","type":"post","link":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/","title":{"rendered":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p"},"content":{"rendered":"<div id=\"ez-toc-container\" class=\"ez-toc-v2_0_85 counter-hierarchy ez-toc-counter ez-toc-grey ez-toc-container-direction\">\n<div class=\"ez-toc-title-container\">\n<p class=\"ez-toc-title\" style=\"cursor:inherit\">N\u1ed9i dung b\u00e0i vi\u1ebft<\/p>\n<span class=\"ez-toc-title-toggle\"><a href=\"#\" class=\"ez-toc-pull-right ez-toc-btn ez-toc-btn-xs ez-toc-btn-default ez-toc-toggle\" aria-label=\"Toggle Table of Content\"><span class=\"ez-toc-js-icon-con\"><span class=\"\"><span class=\"eztoc-hide\" style=\"display:none;\">Toggle<\/span><span class=\"ez-toc-icon-toggle-span\"><svg style=\"fill: #999;color:#999\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" class=\"list-377408\" width=\"20px\" height=\"20px\" viewBox=\"0 0 24 24\" fill=\"none\"><path d=\"M6 6H4v2h2V6zm14 0H8v2h12V6zM4 11h2v2H4v-2zm16 0H8v2h12v-2zM4 16h2v2H4v-2zm16 0H8v2h12v-2z\" fill=\"currentColor\"><\/path><\/svg><svg style=\"fill: #999;color:#999\" class=\"arrow-unsorted-368013\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" width=\"10px\" height=\"10px\" viewBox=\"0 0 24 24\" version=\"1.2\" baseProfile=\"tiny\"><path d=\"M18.2 9.3l-6.2-6.3-6.2 6.3c-.2.2-.3.4-.3.7s.1.5.3.7c.2.2.4.3.7.3h11c.3 0 .5-.1.7-.3.2-.2.3-.5.3-.7s-.1-.5-.3-.7zM5.8 14.7l6.2 6.3 6.2-6.3c.2-.2.3-.5.3-.7s-.1-.5-.3-.7c-.2-.2-.4-.3-.7-.3h-11c-.3 0-.5.1-.7.3-.2.2-.3.5-.3.7s.1.5.3.7z\"\/><\/svg><\/span><\/span><\/span><\/a><\/span><\/div>\n<nav><ul class='ez-toc-list ez-toc-list-level-1 ' ><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-1\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#Data_Analyst_can_nhung_ky_nang_nao\" >Data Analyst c\u1ea7n nh\u1eefng k\u1ef9 n\u0103ng n\u00e0o?<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-2\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#Cac_cau_hoi_phong_van_Data_Analyst_ve_ky_nang_chuyen_mon_Technical_skill\" >C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 k\u1ef9 n\u0103ng chuy\u00ean m\u00f4n (Technical skill)<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-3\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#Cac_cau_hoi_phong_van_Data_Analyst_ve_ky_nang_mem_va_phan_tich_kinh_doanh\" >C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 k\u1ef9 n\u0103ng m\u1ec1m v\u00e0 ph\u00e2n t\u00edch kinh doanh<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-4\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#Cac_cau_hoi_phong_van_Data_Analyst_ve_Quan_ly_cong_viec_Quy_trinh_Truc_quan_hoa_du_lieu\" >C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 Qu\u1ea3n l\u00fd c\u00f4ng vi\u1ec7c, Quy tr\u00ecnh &amp; Tr\u1ef1c quan ho\u00e1 d\u1eef li\u1ec7u<\/a><\/li><li class='ez-toc-page-1 ez-toc-heading-level-2'><a class=\"ez-toc-link ez-toc-heading-5\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#Tong_ket\" >T\u1ed5ng k\u1ebft<\/a><\/li><\/ul><\/nav><\/div>\n<p><em><strong> Theo b\u00e1o c\u00e1o \u201c<a href=\"https:\/\/business.linkedin.com\/talent-solutions\/emerging-jobs-report\" target=\"_blank\" rel=\"noopener\">Jobs on the Rise<\/a>\u201d c\u1ee7a LinkedIn, Data Analyst n\u1eb1m trong Top 10 c\u00f4ng vi\u1ec7c t\u0103ng tr\u01b0\u1edfng m\u1ea1nh nh\u1ea5t to\u00e0n c\u1ea7u v\u1edbi m\u1ee9c t\u0103ng 25% m\u1ed7i n\u0103m \u1edf nhi\u1ec1u th\u1ecb tr\u01b0\u1eddng ph\u00e1t tri\u1ec3n. Th\u00eam v\u00e0o \u0111\u00f3, <a href=\"https:\/\/www.weforum.org\/stories\/2019\/04\/how-much-data-is-generated-each-day-cf4bddf29f\/\" target=\"_blank\" rel=\"noopener\">th\u1ebf gi\u1edbi c\u00f3 th\u1ec3 t\u1ea1o ra t\u1edbi 463 exabyte d\u1eef li\u1ec7u m\u1ed7i ng\u00e0y v\u00e0o n\u0103m 2025<\/a> \u2013 d\u1eef li\u1ec7u d\u1ed3i d\u00e0o nh\u01b0ng v\u00f4 h\u00ecnh n\u1ebfu kh\u00f4ng c\u00f3 ng\u01b0\u1eddi \u201cgi\u1ea3i m\u00e3\u201d. Ch\u00ednh v\u00ec th\u1ebf, b\u00e0i vi\u1ebft n\u00e0y s\u1ebd kh\u00f4ng ch\u1ec9 d\u1eebng l\u1ea1i \u1edf vi\u1ec7c li\u1ec7t k\u00ea nh\u1eefng c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst m\u00e0 c\u00f2n mong mu\u1ed1n gi\u00fap c\u1ea3 \u1ee9ng vi\u00ean l\u1eabn nh\u00e0 tuy\u1ec3n d\u1ee5ng (ho\u1eb7c l\u1eadp tr\u00ecnh vi\u00ean, qu\u1ea3n l\u00fd) hi\u1ec3u s\u00e2u h\u01a1n v\u1ec1 c\u00e1c k\u1ef9 n\u0103ng \u201cx\u01b0\u01a1ng s\u1ed1ng\u201d c\u1ee7a m\u1ed9t Data Analyst.<\/strong><\/em><\/p>\n<p>\u0110\u1ecdc b\u00e0i vi\u1ebft sau \u0111\u1ec3 n\u1eafm v\u1eefng c\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst thu\u1ed9c c\u00e1c ch\u1ee7 \u0111\u1ec1 t\u1eeb vi\u1ec7c l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u (data wrangling), t\u01b0 duy ph\u00e2n t\u00edch kinh doanh, \u0111\u1ebfn thi\u1ebft k\u1ebf v\u00e0 \u0111o l\u01b0\u1eddng ch\u1ec9 s\u1ed1 (metrics).<\/p>\n<h2><span class=\"ez-toc-section\" id=\"Data_Analyst_can_nhung_ky_nang_nao\"><\/span><b>Data Analyst c\u1ea7n nh\u1eefng k\u1ef9 n\u0103ng n\u00e0o?<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3><b>L\u1eadp tr\u00ecnh, l\u00e0m s\u1ea1ch v\u00e0 x\u1eed l\u00fd d\u1eef li\u1ec7u (Python, R, SQL, Excel)<\/b><\/h3>\n<ul>\n<li><b>Ng\u00f4n ng\u1eef l\u1eadp tr\u00ecnh<\/b><span style=\"font-weight: 400;\">: Theo<\/span><span style=\"font-weight: 400;\">\u00a0<a href=\"https:\/\/itviec.com\/bao-cao\/luong-it-va-thi-truong-tuyen-dung-it-vietnam\" target=\"_blank\" rel=\"noopener\"><strong>B\u00e1o c\u00e1o L\u01b0\u01a1ng &amp; Th\u1ecb tr\u01b0\u1eddng Tuy\u1ec3n d\u1ee5ng IT 2025 &#8211; 2026<\/strong><\/a> t\u1eeb ITviec<\/span><span style=\"font-weight: 400;\">, top 5 ng\u00f4n ng\u1eef l\u1eadp tr\u00ecnh ch\u00ednh \u0111\u01b0\u1ee3c Data Analyst s\u1eed d\u1ee5ng \u0111\u1ec3 thao t\u00e1c d\u1eef li\u1ec7u l\u00e0: SQL, Python, HTML\/CSS, SAS.<\/span><\/li>\n<li><b>Excel<\/b><span style=\"font-weight: 400;\">: V\u1eabn l\u00e0 \u201cc\u00f4ng c\u1ee5 qu\u1ed1c d\u00e2n\u201d cho thao t\u00e1c d\u1eef li\u1ec7u nhanh, t\u00ednh to\u00e1n pivot, v\u00e0 \u0111\u1eb7c bi\u1ec7t h\u1eefu \u00edch khi x\u1eed l\u00fd c\u00e1c b\u00e0i to\u00e1n c\u1ee1 v\u1eeba, kh\u00f4ng \u0111\u00f2i h\u1ecfi ki\u1ebfn th\u1ee9c code ph\u1ee9c t\u1ea1p.<\/span><\/li>\n<\/ul>\n<h3><b>Tr\u1ef1c quan ho\u00e1 d\u1eef li\u1ec7u (Tableau, Power BI)<\/b><\/h3>\n<p><span style=\"font-weight: 400;\">Nh\u1eefng c\u00f4ng c\u1ee5 BI (Business Intelligence) nh\u01b0 Tableau, Power BI hay Looker th\u01b0\u1eddng \u0111\u01b0\u1ee3c d\u00f9ng \u0111\u1ec3 \u201ck\u1ec3 c\u00e2u chuy\u1ec7n\u201d b\u1eb1ng h\u00ecnh \u1ea3nh.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Theo <\/span><a href=\"https:\/\/www.salesforce.com\/news\/stories\/gartner-magic-quadrant-analytics-intelligence\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">th\u1ed1ng k\u00ea t\u1eeb Salesforce<\/span><\/a><span style=\"font-weight: 400;\">, c\u00e1c c\u00f4ng ty \u00e1p d\u1ee5ng t\u1ed1t BI v\u00e0 Data Visualization c\u00f3 th\u1ec3 c\u1ea3i thi\u1ec7n quy\u1ebft \u0111\u1ecbnh kinh doanh \u0111\u1ebfn 80% nh\u1edd v\u00e0o kh\u1ea3 n\u0103ng quan s\u00e1t realtime dashboard.<\/span><\/p>\n<h3><b>Ki\u1ebfn th\u1ee9c v\u1ec1 Cloud &amp; IDE (BigQuery, AWS Redshift, Databricks, Jupyter Notebook, VSCode)<\/b><\/h3>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Ng\u00e0y c\u00e0ng nhi\u1ec1u doanh nghi\u1ec7p chuy\u1ec3n d\u1ecbch sang h\u1ea1 t\u1ea7ng Cloud \u0111\u1ec3 x\u1eed l\u00fd kh\u1ed1i l\u01b0\u1ee3ng d\u1eef li\u1ec7u kh\u1ed5ng l\u1ed3 (Big Data). C\u00e1c n\u1ec1n t\u1ea3ng Cloud hi\u1ec7n \u0111\u1ea1i \u0111\u00e3 ch\u1ee9ng t\u1ecf kh\u1ea3 n\u0103ng x\u1eed l\u00fd h\u00e0ng tri\u1ec7u truy v\u1ea5n m\u1ed7i ng\u00e0y, \u0111\u00e1p \u1ee9ng nhu c\u1ea7u ng\u00e0y c\u00e0ng cao c\u1ee7a doanh nghi\u1ec7p.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Vi\u1ec7c d\u00f9ng IDE linh ho\u1ea1t nh\u01b0 Jupyter Notebook ho\u1eb7c VSCode gi\u00fap Data Analyst g\u1ee1 r\u1ed1i code nhanh ch\u00f3ng, ghi ch\u00fa \u0111\u01b0\u1ee3c quy tr\u00ecnh l\u00e0m vi\u1ec7c, v\u00e0 d\u1ec5 d\u00e0ng chia s\u1ebb k\u1ebft qu\u1ea3 trong \u0111\u1ed9i.<\/span><\/li>\n<\/ul>\n<h3><b>Thi\u1ebft k\u1ebf Metrics &amp; Ch\u1ec9 s\u1ed1 \u0111o l\u01b0\u1eddng<\/b><\/h3>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>X\u00e1c \u0111\u1ecbnh KPI, metrics ph\u00f9 h\u1ee3p<\/b><span style=\"font-weight: 400;\">: V\u00ed d\u1ee5, v\u1edbi website th\u01b0\u01a1ng m\u1ea1i \u0111i\u1ec7n t\u1eed, KPI c\u00f3 th\u1ec3 l\u00e0 \u201ct\u1ef7 l\u1ec7 chuy\u1ec3n \u0111\u1ed5i\u201d (conversion rate) ho\u1eb7c \u201cgi\u1ecf h\u00e0ng trung b\u00ecnh\u201d (average order value).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>T\u01b0 duy \u0111o l\u01b0\u1eddng v\u00e0 \u0111\u00e1nh gi\u00e1 hi\u1ec7u qu\u1ea3<\/b><span style=\"font-weight: 400;\">: Bi\u1ebft c\u00e1ch \u0111\u1eb7t m\u1ee5c ti\u00eau h\u1ee3p l\u00fd, tr\u00e1nh r\u01a1i v\u00e0o b\u1eaby \u201cvanity metrics\u201d (ch\u1ec9 s\u1ed1 tr\u00f4ng \u0111\u1eb9p nh\u01b0ng kh\u00f4ng mang l\u1ea1i gi\u00e1 tr\u1ecb th\u1ef1c).<\/span><\/li>\n<\/ul>\n<h3><b>K\u1ef9 n\u0103ng n\u00e2ng cao (Nice to have)<\/b><\/h3>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Data Engineering (ETL, Data Modeling)<\/b><span style=\"font-weight: 400;\">: Th\u00f4ng th\u1ea1o thi\u1ebft k\u1ebf pipeline d\u1eef li\u1ec7u gi\u00fap c\u00f4ng vi\u1ec7c c\u1ee7a Data Analyst tr\u01a1n tru h\u01a1n.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>A\/B Testing, ph\u00e2n t\u00edch th\u1ed1ng k\u00ea n\u00e2ng cao<\/b><span style=\"font-weight: 400;\">: Trong marketing hay s\u1ea3n ph\u1ea9m, A\/B Testing l\u00e0 \u201cx\u01b0\u01a1ng s\u1ed1ng\u201d c\u1ee7a c\u00e1c quy\u1ebft \u0111\u1ecbnh t\u1ed1i \u01b0u tr\u1ea3i nghi\u1ec7m ng\u01b0\u1eddi d\u00f9ng.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Data Science c\u01a1 b\u1ea3n (regression, clustering, time series)<\/b><span style=\"font-weight: 400;\">: \u0110\u01b0a ra d\u1ef1 \u0111o\u00e1n (forecast) v\u00e0 ph\u00e1t hi\u1ec7n xu h\u01b0\u1edbng s\u1edbm, gi\u1ea3m thi\u1ec3u r\u1ee7i ro kinh doanh.<\/span><\/li>\n<\/ul>\n<blockquote><p><em>\u0110\u1ecdc th\u00eam: <a href=\"https:\/\/itviec.com\/blog\/lo-trinh-data-analyst\/\" target=\"_blank\" rel=\"noopener\"><strong>L\u1ed9 tr\u00ecnh ph\u00e1t tri\u1ec3n s\u1ef1 nghi\u1ec7p Data Analyst: V\u1ecb tr\u00ed, C\u00f4ng vi\u1ec7c v\u00e0 Y\u00eau c\u1ea7u<\/strong><\/a><\/em><\/p><\/blockquote>\n<h2><span class=\"ez-toc-section\" id=\"Cac_cau_hoi_phong_van_Data_Analyst_ve_ky_nang_chuyen_mon_Technical_skill\"><\/span><b>C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 k\u1ef9 n\u0103ng chuy\u00ean m\u00f4n (Technical skill)<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3><strong>B\u1ea1n th\u01b0\u1eddng l\u00e0m g\u00ec \u0111\u1ec3 l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u trong pandas (Python)?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Doanh nghi\u1ec7p k\u1ef3 v\u1ecdng \u1ee9ng vi\u00ean n\u1eafm v\u1eefng quy tr\u00ecnh l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u m\u1ed9t c\u00e1ch c\u00f3 h\u1ec7 th\u1ed1ng, kh\u00f4ng ch\u1ec9 bi\u1ebft m\u1ed7i l\u1ec7nh <\/span><span style=\"font-weight: 400;\">dropna()<\/span><span style=\"font-weight: 400;\"> hay <\/span><span style=\"font-weight: 400;\">fillna()<\/span><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">H\u1ecd c\u0169ng mu\u1ed1n bi\u1ebft li\u1ec7u \u1ee9ng vi\u00ean c\u00f3 qu\u1ea3n l\u00fd \u0111\u01b0\u1ee3c vi\u1ec7c ghi nh\u1eadn, gi\u1ea3i th\u00edch l\u00fd do ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p (mean, median, m\u00f4 h\u00ecnh d\u1ef1 \u0111o\u00e1n\u2026) hay kh\u00f4ng, v\u00ec vi\u1ec7c n\u00e0y s\u1ebd \u1ea3nh h\u01b0\u1edfng tr\u1ef1c ti\u1ebfp \u0111\u1ebfn ch\u1ea5t l\u01b0\u1ee3ng ph\u00e2n t\u00edch.<\/span><\/p>\n<p><b>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Trong qu\u00e1 tr\u00ecnh l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u v\u1edbi <\/span><span style=\"font-weight: 400;\">pandas<\/span><span style=\"font-weight: 400;\">, t\u00f4i th\u01b0\u1eddng ti\u1ebfn h\u00e0nh theo c\u00e1c b\u01b0\u1edbc:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">Xem nhanh qua d\u1eef li\u1ec7u (head, info, describe) \u0111\u1ec3 n\u1eafm t\u1ed5ng quan.<\/span><\/li>\n<li><span style=\"font-weight: 400;\">T\u00ecm v\u00e0 x\u1eed l\u00fd d\u1eef li\u1ec7u thi\u1ebfu (missing values), ch\u1eb3ng h\u1ea1n nh\u01b0 d\u00f9ng <\/span><span style=\"font-weight: 400;\">mean()<\/span><span style=\"font-weight: 400;\"> ho\u1eb7c <\/span><span style=\"font-weight: 400;\">median()<\/span><span style=\"font-weight: 400;\"> n\u1ebfu d\u1eef li\u1ec7u ph\u00f9 h\u1ee3p, ho\u1eb7c x\u00f3a d\u00f2ng n\u1ebfu t\u1ef7 l\u1ec7 b\u1ecb thi\u1ebfu nh\u1ecf v\u00e0 kh\u00f4ng \u1ea3nh h\u01b0\u1edfng nhi\u1ec1u \u0111\u1ebfn m\u1eabu.<\/span><\/li>\n<li>Chu\u1ea9n h\u00f3a ki\u1ec3u d\u1eef li\u1ec7u (v\u00ed d\u1ee5: ng\u00e0y th\u00e1ng v\u1ec1 datetime) v\u00e0 ki\u1ec3m tra xem c\u00f3 gi\u00e1 tr\u1ecb ngo\u1ea1i lai (outlier) kh\u00f4ng. N\u1ebfu c\u00f3, t\u00f4i s\u1ebd c\u00e2n nh\u1eafc d\u00f9ng IQR (Interquartile Range) ho\u1eb7c z-score \u0111\u1ec3 lo\u1ea1i b\u1ecf ho\u1eb7c \u0111i\u1ec1u ch\u1ec9nh.<\/li>\n<li>Cu\u1ed1i c\u00f9ng, t\u00f4i th\u01b0\u1eddng l\u01b0u l\u1ea1i c\u00e1c b\u01b0\u1edbc n\u00e0y v\u00e0o notebook \u0111\u1ec3 \u0111\u1ea3m b\u1ea3o quy tr\u00ecnh c\u00f3 t\u00ednh t\u00e1i l\u1eadp.<\/li>\n<\/ul>\n<h3><strong>So s\u00e1nh DataFrame trong pandas v\u00e0 data frame trong R?<\/strong><\/h3>\n<p><strong>pandas.DataFrame<\/strong><span style=\"font-weight: 400;\"><strong> trong Python<\/strong> linh ho\u1ea1t v\u1ec1 vi\u1ec7c thao t\u00e1c v\u1edbi d\u1eef li\u1ec7u l\u1edbn, t\u1ed1c \u0111\u1ed9 x\u1eed l\u00fd nh\u1edd th\u01b0 vi\u1ec7n C (NumPy) h\u1ed7 tr\u1ee3. C\u1ed9ng \u0111\u1ed3ng Python c\u0169ng r\u1ea5t m\u1ea1nh, li\u00ean t\u1ee5c c\u1eadp nh\u1eadt nhi\u1ec1u th\u01b0 vi\u1ec7n m\u1edf r\u1ed9ng (nh\u01b0 <\/span><span style=\"font-weight: 400;\">scikit-learn<\/span><span style=\"font-weight: 400;\"> cho machine learning).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Trong khi \u0111\u00f3, <\/span><strong>data frame<\/strong><span style=\"font-weight: 400;\"><strong> c\u1ee7a R<\/strong> \u0111\u01b0\u1ee3c thi\u1ebft k\u1ebf s\u00e1t v\u1edbi c\u00e1c h\u00e0m th\u1ed1ng k\u00ea, d\u1ec5 d\u00e0ng tri\u1ec3n khai c\u00e1c test th\u1ed1ng k\u00ea ho\u1eb7c c\u00e1c k\u1ef9 thu\u1eadt nh\u01b0 ANOVA. Ngo\u00e0i ra, c\u00e1c h\u00e0m x\u1eed l\u00fd d\u1eef li\u1ec7u \u201cti\u1ec7n tay\u201d nh\u01b0 <\/span><span style=\"font-weight: 400;\">dplyr<\/span><span style=\"font-weight: 400;\">, <\/span><span style=\"font-weight: 400;\">tidyr<\/span><span style=\"font-weight: 400;\"> hay <\/span><span style=\"font-weight: 400;\">ggplot2<\/span><span style=\"font-weight: 400;\"> kh\u00e1 m\u1ea1nh v\u00e0 \u201cchu\u1ea9n\u201d cho nghi\u00ean c\u1ee9u th\u1ed1ng k\u00ea.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f9y m\u1ee5c \u0111\u00edch m\u00e0 t\u00f4i ch\u1ecdn Python (khi thi\u00ean v\u1ec1 s\u1ea3n xu\u1ea5t, c\u1ea7n t\u00edch h\u1ee3p h\u1ec7 th\u1ed1ng, build pipeline) hay R (khi thi\u00ean v\u1ec1 ph\u00e2n t\u00edch th\u1ed1ng k\u00ea chuy\u00ean s\u00e2u, tr\u1ef1c quan).<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng d\u00f9ng th\u01b0 vi\u1ec7n n\u00e0o trong R \u0111\u1ec3 tr\u1ef1c quan ho\u00e1 d\u1eef li\u1ec7u (v\u00ed d\u1ee5: ggplot2)?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">T\u00f4i ch\u1ee7 y\u1ebfu d\u00f9ng <\/span><span style=\"font-weight: 400;\">ggplot2<\/span><span style=\"font-weight: 400;\"> v\u00ec c\u00fa ph\u00e1p chia l\u00e0m nhi\u1ec1u l\u1edbp (grammar of graphics) r\u1ea5t r\u00f5 r\u00e0ng: t\u00f4i \u0111\u1ecbnh ngh\u0129a d\u1eef li\u1ec7u, tr\u1ee5c (aes), sau \u0111\u00f3 th\u00eam c\u00e1c layer (geom_bar, geom_line, v.v.).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Kh\u1ea3 n\u0103ng t\u00f9y bi\u1ebfn c\u1ee7a <\/span><span style=\"font-weight: 400;\">ggplot2<\/span><span style=\"font-weight: 400;\"> r\u1ea5t cao, ph\u00f9 h\u1ee3p \u0111\u1ec3 kh\u00e1m ph\u00e1 d\u1eef li\u1ec7u ho\u1eb7c xu\u1ea5t b\u1ea3n c\u00e1c bi\u1ec3u \u0111\u1ed3 c\u00f3 ch\u1ea5t l\u01b0\u1ee3ng \u201cxu\u1ea5t t\u1ea1p ch\u00ed\u201d. V\u1edbi c\u00e1c b\u00e0i to\u00e1n EDA (Exploratory Data Analysis), t\u00f4i th\u00edch k\u1ebft h\u1ee3p <\/span><span style=\"font-weight: 400;\">ggplot2<\/span><span style=\"font-weight: 400;\"> v\u1edbi <\/span><span style=\"font-weight: 400;\">plotly<\/span><span style=\"font-weight: 400;\"> \u0111\u1ec3 t\u1ea1o bi\u1ec3u \u0111\u1ed3 t\u01b0\u01a1ng t\u00e1c, tr\u00ecnh b\u00e0y tr\u01b0\u1edbc stakeholder.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ngo\u00e0i ggplot2, sau \u0111\u00e2y l\u00e0 danh s\u00e1ch c\u00e1c th\u01b0 vi\u1ec7n R h\u1eefu \u00edch<\/span><span style=\"font-weight: 400;\">\u00a0trong vi\u1ec7c tr\u1ef1c quan h\u00f3a d\u1eef li\u1ec7u:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">plotly<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">lattice<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">highcharter<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">shiny<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">cowplot<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">dygraphs<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">heatmaply<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">visNetwork<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">corrplot<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">rgl<\/span><\/li>\n<\/ul>\n<h3><strong>Vi\u1ebft m\u1ed9t truy v\u1ea5n JOIN t\u1eeb 2\u20133 b\u1ea3ng, r\u1ed3i gi\u1ea3i th\u00edch logic v\u00e0 c\u00e1ch x\u1eed l\u00fd k\u1ebft qu\u1ea3.<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">V\u00ed d\u1ee5, t\u00f4i c\u00f3 b\u1ea3ng <\/span><span style=\"font-weight: 400;\">orders<\/span><span style=\"font-weight: 400;\"> (ch\u1ee9a th\u00f4ng tin \u0111\u01a1n h\u00e0ng) v\u00e0 b\u1ea3ng <\/span><span style=\"font-weight: 400;\">customers<\/span><span style=\"font-weight: 400;\"> (ch\u1ee9a th\u00f4ng tin kh\u00e1ch h\u00e0ng). T\u00f4i mu\u1ed1n l\u1ea5y danh s\u00e1ch kh\u00e1ch h\u00e0ng, c\u00f9ng t\u1ed5ng gi\u00e1 tr\u1ecb \u0111\u01a1n h\u00e0ng c\u1ee7a h\u1ecd.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">C\u00e2u l\u1ec7nh c\u00f3 th\u1ec3 nh\u01b0 sau (gi\u1ea3 s\u1eed d\u00f9ng PostgreSQL):<\/span><\/p>\n<pre><span style=\"font-weight: 400;\">SELECT c.customer_id,<\/span>\n<span style=\"font-weight: 400;\"> \u00a0 \u00a0 \u00a0 c.customer_name,<\/span>\n<span style=\"font-weight: 400;\"> \u00a0 \u00a0 \u00a0 SUM(o.order_amount) AS total_spending<\/span>\n<span style=\"font-weight: 400;\">FROM customers c<\/span>\n<span style=\"font-weight: 400;\">JOIN orders o ON c.customer_id = o.customer_id<\/span>\n<span style=\"font-weight: 400;\">GROUP BY c.customer_id, c.customer_name<\/span>\n<span style=\"font-weight: 400;\">ORDER BY total_spending DESC;<\/span><\/pre>\n<p><span style=\"font-weight: 400;\">\u1ede \u0111\u00e2y, t\u00f4i d\u00f9ng INNER JOIN \u0111\u1ec3 l\u1ea5y nh\u1eefng kh\u00e1ch h\u00e0ng c\u00f3 \u00edt nh\u1ea5t m\u1ed9t \u0111\u01a1n h\u00e0ng. Sau \u0111\u00f3, t\u00f4i d\u00f9ng h\u00e0m t\u1ed5ng <\/span><span style=\"font-weight: 400;\">SUM<\/span><span style=\"font-weight: 400;\"> \u0111\u1ec3 t\u00ednh t\u1ed5ng chi ti\u00eau v\u00e0 GROUP BY \u0111\u1ec3 g\u1ed9p theo t\u1eebng kh\u00e1ch h\u00e0ng. \u0110i\u1ec1u n\u00e0y cho ph\u00e9p t\u00f4i bi\u1ebft \u0111\u01b0\u1ee3c ai l\u00e0 kh\u00e1ch h\u00e0ng chi tr\u1ea3 nhi\u1ec1u nh\u1ea5t.<\/span><\/p>\n<blockquote><p><em>\u0110\u1ecdc th\u00eam: <a href=\"https:\/\/itviec.com\/blog\/join-trong-sql\/\" target=\"_blank\" rel=\"noopener\"><strong>JOIN trong SQL: C\u00fa ph\u00e1p v\u00e0 c\u00e1ch s\u1eed d\u1ee5ng c\u00e1c ph\u00e9p JOIN<\/strong><\/a><\/em><\/p><\/blockquote>\n<h3><strong>L\u00e0m th\u1ebf n\u00e0o \u0111\u1ec3 t\u1ed1i \u01b0u h\u00f3a m\u1ed9t truy v\u1ea5n SQL ch\u1ea1y r\u1ea5t ch\u1eadm (ch\u1eb3ng h\u1ea1n m\u1ea5t \u0111\u1ebfn 10 gi\u00e2y \u0111\u1ec3 ho\u00e0n th\u00e0nh)? Li\u1ec7u vi\u1ec7c th\u00eam Index ho\u1eb7c s\u1eed d\u1ee5ng EXPLAIN PLAN (hay c\u00e1c ph\u01b0\u01a1ng ph\u00e1p kh\u00e1c) c\u00f3 gi\u00fap x\u00e1c \u0111\u1ecbnh v\u00e0 c\u1ea3i thi\u1ec7n \u0111i\u1ec3m ngh\u1ebdn hi\u1ec7u su\u1ea5t?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">\u0110\u1ea7u ti\u00ean, t\u00f4i s\u1ebd ki\u1ec3m tra truy v\u1ea5n v\u1edbi <\/span><span style=\"font-weight: 400;\">EXPLAIN<\/span><span style=\"font-weight: 400;\"> ho\u1eb7c <\/span><span style=\"font-weight: 400;\">EXPLAIN ANALYZE<\/span><span style=\"font-weight: 400;\"> (tu\u1ef3 h\u1ec7 qu\u1ea3n tr\u1ecb) \u0111\u1ec3 xem truy v\u1ea5n \u0111ang s\u1eed d\u1ee5ng lo\u1ea1i scan g\u00ec. N\u1ebfu truy v\u1ea5n \u0111ang s\u1eed d\u1ee5ng \u201csequential scan\u201d trong khi c\u1ed9t d\u00f9ng \u0111\u1ec3 JOIN ho\u1eb7c WHERE kh\u00f4ng \u0111\u01b0\u1ee3c \u0111\u00e1nh ch\u1ec9 m\u1ee5c (index), th\u00ec t\u00f4i s\u1ebd \u0111\u1ec1 xu\u1ea5t t\u1ea1o index ph\u00f9 h\u1ee3p.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i c\u0169ng s\u1ebd xem c\u00f3 \u201cbottleneck\u201d \u1edf ch\u1ed7 n\u00e0o kh\u00f4ng, v\u00ed d\u1ee5 do nhi\u1ec1u JOIN l\u1ed3ng nhau, GROUP BY tr\u00ean d\u1eef li\u1ec7u c\u1ef1c l\u1edbn. \u0110\u00f4i l\u00fac, ch\u1ec9 c\u1ea7n t\u00e1ch logic th\u00e0nh nhi\u1ec1u c\u00e2u truy v\u1ea5n trung gian ho\u1eb7c s\u1eed d\u1ee5ng <\/span><span style=\"font-weight: 400;\">CTE<\/span><span style=\"font-weight: 400;\"> (Common Table Expression) c\u0169ng gi\u00fap d\u1ec5 t\u1ed1i \u01b0u h\u01a1n.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ngo\u00e0i ra, tr\u00e1nh d\u00f9ng <\/span><a href=\"https:\/\/itviec.com\/blog\/select-trong-sql\/\" target=\"_blank\" rel=\"noopener\"><span style=\"font-weight: 400;\">SELECT *<\/span><\/a><span style=\"font-weight: 400;\"> n\u1ebfu kh\u00f4ng c\u1ea7n.<\/span><\/p>\n<h3><strong>Window functions trong SQL \u0111\u01b0\u1ee3c d\u00f9ng trong tr\u01b0\u1eddng h\u1ee3p n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\"><strong>Window functions<\/strong> r\u1ea5t h\u1eefu \u00edch khi t\u00f4i c\u1ea7n th\u1ef1c hi\u1ec7n c\u00e1c ph\u00e9p t\u00ednh t\u1ed5ng (SUM, AVG, COUNT) ho\u1eb7c t\u00ednh to\u00e1n lu\u1ef9 ti\u1ebfn (running total) m\u00e0 kh\u00f4ng mu\u1ed1n nh\u00f3m d\u1eef li\u1ec7u ho\u00e0n to\u00e0n (nh\u01b0 GROUP BY). <\/span><span style=\"font-weight: 400;\">V\u00ed d\u1ee5, d\u00f9ng <\/span><span style=\"font-weight: 400;\">RANK()<\/span><span style=\"font-weight: 400;\"> hay <\/span><span style=\"font-weight: 400;\">ROW_NUMBER()<\/span><span style=\"font-weight: 400;\"> \u0111\u1ec3 x\u1ebfp h\u1ea1ng kh\u00e1ch h\u00e0ng d\u1ef1a tr\u00ean gi\u00e1 tr\u1ecb \u0111\u01a1n h\u00e0ng, ho\u1eb7c t\u00ednh t\u1ef7 l\u1ec7 ph\u1ea7n tr\u0103m (PERCENT_RANK).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Window functions gi\u00fap ph\u00e2n t\u00edch \u201ctheo nh\u00f3m\u201d (partition) nh\u01b0ng v\u1eabn gi\u1eef nguy\u00ean t\u00ednh ch\u1ea5t t\u1eebng d\u00f2ng, r\u1ea5t h\u1eefu \u00edch khi x\u00e2y d\u1ef1ng b\u00e1o c\u00e1o t\u00e0i ch\u00ednh tu\u1ea7n t\u1ef1, ho\u1eb7c so s\u00e1nh d\u1eef li\u1ec7u th\u00e1ng hi\u1ec7n t\u1ea1i v\u1edbi th\u00e1ng li\u1ec1n tr\u01b0\u1edbc.<\/span><\/p>\n<h3><strong>Khi n\u00e0o b\u1ea1n s\u1eed d\u1ee5ng Pivot Table?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Pivot Table l\u00e0 c\u00f4ng c\u1ee5 tuy\u1ec7t v\u1eddi khi t\u00f4i c\u1ea7n t\u1ed5ng h\u1ee3p d\u1eef li\u1ec7u nhanh, xem t\u1ef7 l\u1ec7, xem t\u1ed5ng doanh thu theo th\u00e1ng ho\u1eb7c theo khu v\u1ef1c m\u1ed9t c\u00e1ch linh ho\u1ea1t.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ch\u1eb3ng h\u1ea1n, n\u1ebfu t\u00f4i c\u00f3 b\u1ea3ng giao d\u1ecbch 10.000 d\u00f2ng, thay v\u00ec code Python, t\u00f4i ch\u1ec9 c\u1ea7n v\u00e0i gi\u00e2y k\u00e9o-th\u1ea3 tr\u01b0\u1eddng (field) trong Pivot Table \u0111\u1ec3 thay \u0111\u1ed5i g\u00f3c nh\u00ecn d\u1eef li\u1ec7u (theo Ng\u00e0y, theo S\u1ea3n ph\u1ea9m, theo \u0110\u1ecba ph\u01b0\u01a1ng). \u0110i\u1ec1u n\u00e0y c\u1ef1c k\u1ef3 h\u1eefu \u00edch trong bu\u1ed5i h\u1ecdp khi stakeholder mu\u1ed1n ki\u1ec3m tra nhanh m\u1ed9t ch\u1ec9 s\u1ed1.<\/span><\/p>\n<h3><strong>Khi c\u1ea7n d\u00f2 t\u00ecm gi\u00e1 tr\u1ecb th\u00ec b\u1ea1n s\u1eed d\u1ee5ng INDEX\/MATCH hay VLOOKUP? L\u00fd do v\u00ec sao?<\/strong><\/h3>\n<table>\n<tbody>\n<tr>\n<td><b>Ti\u00eau ch\u00ed<\/b><\/td>\n<td><b>VLOOKUP<\/b><\/td>\n<td><b>INDEX\/MATCH<\/b><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">C\u00e1ch th\u1ee9c ho\u1ea1t \u0111\u1ed9ng<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; T\u00ecm ki\u1ebfm m\u1ed9t gi\u00e1 tr\u1ecb trong c\u1ed9t \u0111\u1ea7u ti\u00ean c\u1ee7a b\u1ea3ng. &lt;br\/&gt; &#8211; Tr\u1ea3 v\u1ec1 gi\u00e1 tr\u1ecb t\u1eeb m\u1ed9t c\u1ed9t \u0111\u01b0\u1ee3c ch\u1ec9 \u0111\u1ecbnh.<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; K\u1ebft h\u1ee3p hai h\u00e0m: &lt;br\/&gt; MATCH t\u00ecm v\u1ecb tr\u00ed c\u1ee7a gi\u00e1 tr\u1ecb trong m\u1ed9t m\u1ea3ng\/d\u00f2ng\/c\u1ed9t.&lt;br\/&gt; INDEX tr\u1ea3 v\u1ec1 gi\u00e1 tr\u1ecb t\u1ea1i v\u1ecb tr\u00ed \u0111\u00f3 trong v\u00f9ng d\u1eef li\u1ec7u.<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">C\u00fa ph\u00e1p c\u01a1 b\u1ea3n<\/span><\/td>\n<td><span style=\"font-weight: 400;\">VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; MATCH(lookup_value, lookup_array, [match_type]) &lt;br\/&gt; &#8211; INDEX(array, row_num, [column_num])<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">\u01afu \u0111i\u1ec3m<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; D\u1ec5 h\u1ecdc, d\u1ec5 d\u00f9ng \u0111\u1ed1i v\u1edbi ng\u01b0\u1eddi m\u1edbi v\u00ec ch\u1ec9 c\u1ea7n m\u1ed9t h\u00e0m duy nh\u1ea5t. &lt;br\/&gt; &#8211; Ph\u1ed5 bi\u1ebfn, nhi\u1ec1u ng\u01b0\u1eddi d\u00f9ng Excel \u0111\u1ec1u bi\u1ebft.<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; Linh ho\u1ea1t, kh\u00f4ng b\u1eaft bu\u1ed9c c\u1ed9t d\u00f2 t\u00ecm ph\u1ea3i n\u1eb1m \u1edf v\u1ecb tr\u00ed \u0111\u1ea7u ti\u00ean. &lt;br\/&gt; &#8211; Khi c\u1ea5u tr\u00fac b\u1ea3ng thay \u0111\u1ed5i (th\u00eam\/xo\u00e1 c\u1ed9t), c\u00f4ng th\u1ee9c kh\u00f4ng b\u1ecb \u1ea3nh h\u01b0\u1edfng nhi\u1ec1u. &lt;br\/&gt; &#8211; Th\u01b0\u1eddng cho hi\u1ec7u su\u1ea5t t\u1ed1t h\u01a1n tr\u00ean b\u1ea3ng d\u1eef li\u1ec7u l\u1edbn.<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">Nh\u01b0\u1ee3c \u0111i\u1ec3m<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; B\u1eaft bu\u1ed9c c\u1ed9t ch\u1ee9a gi\u00e1 tr\u1ecb d\u00f2 ph\u1ea3i l\u00e0 c\u1ed9t \u0111\u1ea7u ti\u00ean. &lt;br\/&gt; &#8211; D\u1ec5 g\u00e2y l\u1ed7i khi c\u1ea5u tr\u00fac b\u1ea3ng thay \u0111\u1ed5i (th\u00eam\/xo\u00e1 c\u1ed9t thay \u0111\u1ed5i th\u1ee9 t\u1ef1).<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; S\u1eed d\u1ee5ng k\u1ebft h\u1ee3p hai h\u00e0m, d\u1ec5 g\u00e2y kh\u00f3 hi\u1ec3u v\u1edbi ng\u01b0\u1eddi \u00edt kinh nghi\u1ec7m. &lt;br\/&gt; &#8211; Vi\u1ec7c vi\u1ebft c\u00f4ng th\u1ee9c d\u00e0i h\u01a1n (2 h\u00e0m) so v\u1edbi VLOOKUP (1 h\u00e0m).<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">\u0110\u1ed9 kh\u00f3 ti\u1ebfp c\u1eadn<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; D\u1ec5 ti\u1ebfp c\u1eadn cho ng\u01b0\u1eddi m\u1edbi h\u1ecdc Excel ho\u1eb7c m\u1edbi l\u00e0m quen h\u00e0m tra c\u1ee9u.<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; M\u1ee9c \u0111\u1ed9 trung b\u00ecnh, c\u1ea7n n\u1eafm v\u1eefng c\u00e1ch v\u1eadn h\u00e0nh c\u1ee7a c\u1ea3 MATCH l\u1eabn INDEX.<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">T\u1ed1c \u0111\u1ed9\/Hi\u1ec7u su\u1ea5t<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; V\u1edbi b\u1ea3ng d\u1eef li\u1ec7u l\u1edbn, VLOOKUP c\u00f3 th\u1ec3 ch\u1eadm h\u01a1n v\u00ec m\u1ed7i l\u1ea7n h\u00e0m c\u1ea7n duy\u1ec7t nhi\u1ec1u c\u1ed9t.<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; Th\u01b0\u1eddng nhanh h\u01a1n v\u00e0 linh ho\u1ea1t h\u01a1n tr\u00ean b\u1ea3ng d\u1eef li\u1ec7u l\u1edbn. &lt;br\/&gt; &#8211; T\u1ed1i \u01b0u khi c\u1ea7n tra c\u1ee9u \u1edf nhi\u1ec1u v\u1ecb tr\u00ed kh\u00e1c nhau.<\/span><\/td>\n<\/tr>\n<tr>\n<td><span style=\"font-weight: 400;\">Tr\u01b0\u1eddng h\u1ee3p s\u1eed d\u1ee5ng \u0111i\u1ec3n h\u00ecnh<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; D\u1eef li\u1ec7u s\u1eafp x\u1ebfp theo ki\u1ec3u \u201clookup\u201d c\u1ed1 \u0111\u1ecbnh, \u00edt thay \u0111\u1ed5i c\u1ea5u tr\u00fac c\u1ed9t. &lt;br\/&gt; &#8211; Nh\u00f3m l\u00e0m vi\u1ec7c quen VLOOKUP, mu\u1ed1n nhanh ch\u00f3ng \u00e1p d\u1ee5ng m\u00e0 kh\u00f4ng c\u1ea7n \u0111\u00e0o t\u1ea1o nhi\u1ec1u.<\/span><\/td>\n<td><span style=\"font-weight: 400;\">&#8211; D\u1eef li\u1ec7u c\u00f3 th\u1ec3 thay \u0111\u1ed5i c\u1ea5u tr\u00fac c\u1ed9t. &lt;br\/&gt; &#8211; C\u1ea7n truy xu\u1ea5t gi\u00e1 tr\u1ecb \u1edf nhi\u1ec1u c\u1ed9t kh\u00e1c nhau, ho\u1eb7c duy\u1ec7t nhi\u1ec1u chi\u1ec1u. &lt;br\/&gt; &#8211; B\u1ea3ng d\u1eef li\u1ec7u l\u1edbn y\u00eau c\u1ea7u t\u1ed1c \u0111\u1ed9 tra c\u1ee9u nhanh.<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><strong>B\u1ea1n \u0111\u00e3 bao gi\u1edd d\u00f9ng Data Validation \u0111\u1ec3 ki\u1ec3m so\u00e1t input trong Excel ch\u01b0a?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">C\u00f3.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Data Validation cho ph\u00e9p t\u00f4i g\u00e1n c\u00e1c quy t\u1eafc, v\u00ed d\u1ee5 nh\u01b0 ch\u1ec9 cho ph\u00e9p ng\u01b0\u1eddi d\u00f9ng nh\u1eadp ng\u00e0y th\u00e1ng trong m\u1ed9t kho\u1ea3ng nh\u1ea5t \u0111\u1ecbnh, ho\u1eb7c ch\u1ec9 ch\u1ecdn gi\u00e1 tr\u1ecb t\u1eeb danh s\u00e1ch. C\u00e1ch n\u00e0y gi\u00fap d\u1eef li\u1ec7u \u201cs\u1ea1ch\u201d ngay t\u1eeb khi nh\u1eadp, gi\u1ea3m r\u1ee7i ro sai s\u00f3t do l\u1ed7i g\u00f5. Th\u1eadm ch\u00ed, t\u00f4i c\u00f2n b\u1eadt c\u1ea3 th\u00f4ng b\u00e1o c\u1ea3nh b\u00e1o khi ng\u01b0\u1eddi nh\u1eadp sai lo\u1ea1i d\u1eef li\u1ec7u.<\/span><\/p>\n<h3><strong>B\u1ea1n th\u01b0\u1eddng x\u1eed l\u00fd gi\u00e1 tr\u1ecb thi\u1ebfu (missing) b\u1eb1ng c\u00e1ch n\u00e0o (mean imputation, median, x\u00f3a d\u00f2ng)?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Ph\u01b0\u01a1ng ph\u00e1p x\u1eed l\u00fd ph\u1ee5 thu\u1ed9c v\u00e0o \u0111\u1eb7c th\u00f9 t\u1eadp d\u1eef li\u1ec7u, v\u00ed d\u1ee5:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">N\u1ebfu d\u1eef li\u1ec7u thi\u1ebfu ch\u1ec9 chi\u1ebfm t\u1ef7 l\u1ec7 nh\u1ecf (v\u00ed d\u1ee5 &lt;5%), x\u00f3a d\u00f2ng c\u00f3 th\u1ec3 t\u1ea1m ch\u1ea5p nh\u1eadn.<\/span><\/li>\n<li><span style=\"font-weight: 400;\">C\u00f2n n\u1ebfu b\u1ecb thi\u1ebfu \u1edf c\u1ed9t quan tr\u1ecdng, t\u00f4i \u01b0u ti\u00ean t\u00ednh to\u00e1n c\u00e1c \u201c\u0111i\u1ec3m thay th\u1ebf\u201d nh\u01b0 mean, median ho\u1eb7c d\u00f9ng m\u00f4 h\u00ecnh d\u1ef1 \u0111o\u00e1n (ch\u1eb3ng h\u1ea1n <\/span><span style=\"font-weight: 400;\">KNN imputer<\/span><span style=\"font-weight: 400;\"> trong Python) \u0111\u1ec3 \u01b0\u1edbc l\u01b0\u1ee3ng.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Tuy nhi\u00ean, t\u00f4i lu\u00f4n ki\u1ec3m tra li\u1ec7u vi\u1ec7c thay th\u1ebf \u1ea5y c\u00f3 l\u00e0m \u201csai l\u1ec7ch\u201d ph\u00e2n ph\u1ed1i d\u1eef li\u1ec7u hay kh\u00f4ng.<\/span><\/p>\n<h3><strong>C\u00e1ch b\u1ea1n ph\u00e1t hi\u1ec7n v\u00e0 x\u1eed l\u00fd outlier?<\/strong><\/h3>\n<p><strong>Outlier<\/strong> l\u00e0 m\u1ed9t s\u1ed1 c\u00e1c \u0111i\u1ec3m d\u1eef li\u1ec7u r\u1ea5t b\u1ea5t th\u01b0\u1eddng so v\u1edbi t\u1eadp d\u1eef li\u1ec7u, n\u1eb1m xa gi\u00e1 tr\u1ecb trung b\u00ecnh c\u1ee7a t\u1eadp, kh\u00f4ng tu\u00e2n theo c\u00e1c quy lu\u1eadt ph\u00e2n ph\u1ed1i chu\u1ea9n.<\/p>\n<p><span style=\"font-weight: 400;\">\u0110\u1ec3 ph\u00e1t hi\u1ec7u outlier, t\u00f4i th\u01b0\u1eddng v\u1ebd c\u00e1c bi\u1ec3u \u0111\u1ed3 nh\u01b0 boxplot, histogram \u0111\u1ec3 ph\u00e1t hi\u1ec7n outlier m\u1ed9t c\u00e1ch tr\u1ef1c quan. <\/span><span style=\"font-weight: 400;\">V\u1ec1 m\u1eb7t \u0111\u1ecbnh l\u01b0\u1ee3ng, IQR (Interquartile Range) l\u00e0 m\u1ed9t ch\u1ec9 s\u1ed1 th\u01b0\u1eddng d\u00f9ng: nh\u1eefng gi\u00e1 tr\u1ecb n\u1eb1m ngo\u00e0i [Q1 &#8211; 1.5<\/span><i><span style=\"font-weight: 400;\">IQR, Q3 + 1.5<\/span><\/i><span style=\"font-weight: 400;\">IQR] c\u00f3 th\u1ec3 coi l\u00e0 outlier.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ngo\u00e0i ra, t\u00f4i d\u00f9ng z-score n\u1ebfu d\u1eef li\u1ec7u g\u1ea7n ph\u00e2n ph\u1ed1i chu\u1ea9n. Vi\u1ec7c x\u1eed l\u00fd outlier c\u00f3 th\u1ec3 l\u00e0 lo\u1ea1i b\u1ecf (khi outlier l\u00e0 l\u1ed7i) ho\u1eb7c winsorizing (g\u00e1n b\u1eb1ng gi\u00e1 tr\u1ecb ng\u01b0\u1ee1ng).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Quan tr\u1ecdng l\u00e0 t\u00f4i ph\u1ea3i \u0111\u1eb7t c\u00e2u h\u1ecfi li\u1ec7u outlier \u0111\u00f3 c\u00f3 \u00fd ngh\u0129a v\u1ec1 m\u1eb7t kinh doanh hay kh\u00f4ng (v\u00ed d\u1ee5, VIP user chi ti\u1ec1n c\u1ef1c l\u1edbn kh\u00f4ng ph\u1ea3i \u201cl\u1ed7i\u201d m\u00e0 l\u00e0 t\u1ec7p kh\u00e1ch h\u00e0ng \u0111\u1eb7c bi\u1ec7t).<\/span><\/p>\n<h3><strong>B\u1ea1n h\u1ee3p nh\u1ea5t nhi\u1ec1u file CSV th\u00e0nh m\u1ed9t dataset l\u1edbn nh\u01b0 th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">T\u00f4i th\u01b0\u1eddng s\u1eed d\u1ee5ng <\/span><strong>pandas.concat()<\/strong><span style=\"font-weight: 400;\"> ho\u1eb7c <\/span><strong>pd.merge()<\/strong><span style=\"font-weight: 400;\"> khi mu\u1ed1n gh\u00e9p theo c\u1ed9t chung.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Tr\u01b0\u1edbc khi h\u1ee3p nh\u1ea5t, t\u00f4i ki\u1ec3m tra k\u1ef9 xem t\u00ean c\u1ed9t c\u00f3 nh\u1ea5t qu\u00e1n kh\u00f4ng, encode k\u00fd t\u1ef1 th\u1ebf n\u00e0o (UTF-8 hay ANSI), c\u00f3 c\u1ed9t n\u00e0o tr\u00f9ng ho\u1eb7c b\u1ecb l\u1eb7p gi\u00e1 tr\u1ecb. T\u00f4i c\u0169ng xem x\u00e9t dung l\u01b0\u1ee3ng RAM v\u00e0 c\u00f3 th\u1ec3 chuy\u1ec3n sang \u0111\u1ecdc d\u1eef li\u1ec7u theo chunks n\u1ebfu file qu\u00e1 l\u1edbn.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Cu\u1ed1i c\u00f9ng, t\u00f4i g\u1eafn c\u1edd (flag) ho\u1eb7c th\u00eam c\u1ed9t ch\u1ec9 \u0111\u1ecbnh \u201cngu\u1ed3n\u201d (source) \u0111\u1ec3 sau n\u00e0y ph\u00e2n t\u00edch d\u1eef li\u1ec7u cho d\u1ec5 \u0111\u1ed1i chi\u1ebfu.<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng ch\u1ea1y query tr\u00ean BigQuery\/AWS Redshift ch\u01b0a? So s\u00e1nh BigQuery v\u1edbi SQL truy\u1ec1n th\u1ed1ng?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">T\u00f4i \u0111\u00e3 d\u00f9ng BigQuery cho c\u00e1c d\u1ef1 \u00e1n ph\u00e2n t\u00edch d\u1eef li\u1ec7u clickstream.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">BigQuery l\u00e0 d\u1ea1ng \u201cserverless\u201d, kh\u00f4ng c\u1ea7n lo nhi\u1ec1u v\u1ec1 c\u1ea5u h\u00ecnh m\u00e1y ch\u1ee7, tr\u1ea3 ph\u00ed theo kh\u1ed1i l\u01b0\u1ee3ng d\u1eef li\u1ec7u \u0111\u01b0\u1ee3c qu\u00e9t. AWS Redshift l\u1ea1i theo h\u01b0\u1edbng MPP (Massively Parallel Processing), khi c\u1ea7n x\u1eed l\u00fd data warehouse \u1edf quy m\u00f4 l\u1edbn. C\u1ea3 hai \u0111\u1ec1u t\u01b0\u01a1ng th\u00edch ng\u00f4n ng\u1eef SQL, nh\u01b0ng khi ch\u1ea1y BigQuery, t\u00f4i th\u01b0\u1eddng c\u1ea9n th\u1eadn l\u1ecdc b\u1edbt c\u1ed9t, tr\u00e1nh <\/span><span style=\"font-weight: 400;\">SELECT *<\/span><span style=\"font-weight: 400;\"> v\u00ec ph\u00ed t\u00ednh d\u1ef1a tr\u00ean l\u01b0\u1ee3ng d\u1eef li\u1ec7u qu\u00e9t.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u1ede g\u00f3c \u0111\u1ed9 truy\u1ec1n th\u1ed1ng, t\u00f4i ph\u1ea3i t\u1ef1 t\u1ed1i \u01b0u nhi\u1ec1u h\u01a1n v\u1ec1 m\u1eb7t c\u00e0i \u0111\u1eb7t (\u0111\u1eb7t index, ph\u00e2n m\u1ea3nh b\u1ea3ng\u2026) trong nh\u1eefng h\u1ec7 th\u1ed1ng nh\u01b0 MySQL hay PostgreSQL.<\/span><\/p>\n<h3><strong>B\u1ea1n th\u00edch d\u00f9ng Jupyter Notebook hay VSCode \u0111\u1ec3 ph\u00e2n t\u00edch d\u1eef li\u1ec7u? T\u1ea1i sao?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">M\u1ed7i c\u00f4ng c\u1ee5 c\u00f3 \u01b0u \u0111i\u1ec3m ri\u00eang, nh\u01b0:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\"><strong>Jupyter Notebook<\/strong> cho ph\u00e9p t\u00f4i ghi ch\u00fa (markdown) v\u00e0 hi\u1ec3n th\u1ecb bi\u1ec3u \u0111\u1ed3 ngay d\u01b0\u1edbi cell code, r\u1ea5t th\u00e2n thi\u1ec7n khi t\u00f4i demo m\u00f4 h\u00ecnh ho\u1eb7c chia s\u1ebb v\u1edbi \u0111\u1ed3ng nghi\u1ec7p phi k\u1ef9 thu\u1eadt.<\/span><\/li>\n<li><span style=\"font-weight: 400;\"><strong>VSCode<\/strong> th\u00ec m\u1ea1nh \u1edf kh\u1ea3 n\u0103ng qu\u1ea3n l\u00fd d\u1ef1 \u00e1n l\u1edbn, t\u00edch h\u1ee3p Git, debugging, nhi\u1ec1u extension (ESLint, Python lint\u2026).<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">T\u00f9y v\u00e0o t\u00ednh ch\u1ea5t c\u1ee7a c\u00f4ng vi\u1ec7c m\u00e0 t\u00f4i ch\u1ecdn c\u00f4ng c\u1ee5:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">N\u1ebfu l\u00e0m quick EDA hay prototyping, t\u00f4i nghi\u00eang v\u1ec1 Jupyter Notebook.<\/span><\/li>\n<li><span style=\"font-weight: 400;\">C\u00f2n khi code \u201cproduction\u201d hay c\u1ea7n c\u1ea5u tr\u00fac d\u1ef1 \u00e1n r\u00f5 r\u00e0ng, t\u00f4i d\u00f9ng VSCode.<\/span><\/li>\n<\/ul>\n<h3><strong>Khi n\u00e0o n\u00ean d\u00f9ng bi\u1ec3u \u0111\u1ed3 c\u1ed9t, \u0111\u01b0\u1eddng, hay scatter plot?<\/strong><\/h3>\n<p>T\u00f4i ch\u1ecdn d\u1ea1ng bi\u1ec3u \u0111\u1ed3 tu\u1ef3 m\u1ee5c ti\u00eau tr\u00ecnh b\u00e0y v\u00e0 ki\u1ec3u d\u1eef li\u1ec7u:<\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Bi\u1ec3u \u0111\u1ed3 c\u1ed9t (bar chart) d\u00f9ng \u0111\u1ec3 so s\u00e1nh danh m\u1ee5c (v\u00ed d\u1ee5: top 5 s\u1ea3n ph\u1ea9m b\u00e1n ch\u1ea1y).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Bi\u1ec3u \u0111\u1ed3 \u0111\u01b0\u1eddng (line chart) ph\u00f9 h\u1ee3p \u0111\u1ec3 theo d\u00f5i xu h\u01b0\u1edbng theo chu\u1ed7i th\u1eddi gian (doanh thu theo ng\u00e0y).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">Scatter plot (bi\u1ec3u \u0111\u1ed3 ph\u00e2n t\u00e1n) gi\u00fap xem m\u1ed1i t\u01b0\u01a1ng quan gi\u1eefa hai bi\u1ebfn, ch\u1eb3ng h\u1ea1n gi\u1eefa \u201cgi\u00e1 tr\u1ecb \u0111\u01a1n h\u00e0ng\u201d v\u00e0 \u201cs\u1ed1 l\u1ea7n \u0111\u0103ng nh\u1eadp\u201d.<\/span><\/li>\n<\/ul>\n<h3><strong>B\u1ea1n c\u00f3 kinh nghi\u1ec7m thi\u1ebft k\u1ebf dashboard cho c\u1ea5p l\u00e3nh \u0111\u1ea1o tr\u00ean Tableau\/Power BI? B\u00ed quy\u1ebft l\u00e0m dashboard \u201cd\u1ec5 hi\u1ec3u\u201d?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Doanh nghi\u1ec7p mong \u1ee9ng vi\u00ean c\u00f3 kh\u1ea3 n\u0103ng \u201cstorytelling\u201d, thi\u1ebft k\u1ebf dashboard kh\u00f4ng ch\u1ec9 \u0111\u1eb9p m\u00e0 c\u00f2n ti\u1ebft ki\u1ec7m th\u1eddi gian cho l\u00e3nh \u0111\u1ea1o. H\u1ecd \u0111\u00e1nh gi\u00e1 cao \u1ee9ng vi\u00ean c\u00f3 t\u01b0 duy t\u1ed1i gi\u1ea3n v\u00e0 \u01b0u ti\u00ean t\u00ednh \u201cactionable insight.\u201d<\/span><\/p>\n<p><b>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i th\u01b0\u1eddng b\u1eaft \u0111\u1ea7u b\u1eb1ng vi\u1ec7c h\u1ecfi s\u1ebfp\/manager: \u201cAnh\/ch\u1ecb mu\u1ed1n nh\u00ecn th\u1ea5y d\u1eef li\u1ec7u g\u00ec ngay khi m\u1edf dashboard?\u201d Th\u00f4ng th\u01b0\u1eddng, l\u00e3nh \u0111\u1ea1o mu\u1ed1n xem KPI quan tr\u1ecdng (doanh thu, l\u1ee3i nhu\u1eadn, user m\u1edbi\u2026), k\u00e8m xu h\u01b0\u1edbng.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Do \u0111\u00f3, t\u00f4i \u0111\u1eb7t c\u00e1c ch\u1ec9 s\u1ed1 ch\u00ednh \u1edf top, d\u00f9ng gam m\u00e0u trung t\u00ednh, highlight c\u1ed9t hay \u0111\u01b0\u1eddng bi\u1ec3u \u0111\u1ed3 khi c\u1ea7n t\u1eadp trung. T\u00f4i c\u0169ng h\u1ea1n ch\u1ebf nh\u1ed3i nh\u00e9t qu\u00e1 nhi\u1ec1u bi\u1ec3u \u0111\u1ed3, thay v\u00e0o \u0111\u00f3 chia th\u00e0nh c\u00e1c tab\/dash nh\u1ecf g\u1ecdn.<\/span><\/p>\n<h3><strong>B\u1ea1n th\u01b0\u1eddng ch\u1ecdn KPI n\u00e0o v\u00e0 s\u1eafp x\u1ebfp b\u1ed1 c\u1ee5c dashboard ra sao \u0111\u1ec3 truy\u1ec1n \u0111\u1ea1t insight nhanh nh\u1ea5t?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">K\u1ef9 n\u0103ng thi\u1ebft k\u1ebf dashboard v\u00e0 l\u1ef1a ch\u1ecdn KPI ph\u00f9 h\u1ee3p l\u00e0 c\u1ef1c k\u1ef3 quan tr\u1ecdng, nh\u1ea5t l\u00e0 v\u1edbi Data Analyst trong m\u00f4i tr\u01b0\u1eddng c\u00f4ng ngh\u1ec7 cao. Doanh nghi\u1ec7p mu\u1ed1n \u0111\u00e1nh gi\u00e1 li\u1ec7u b\u1ea1n c\u00f3 kh\u1ea3 n\u0103ng x\u00e1c \u0111\u1ecbnh ch\u1ec9 s\u1ed1 c\u1ed1t l\u00f5i, t\u1ed5 ch\u1ee9c b\u1ed1 c\u1ee5c h\u1ee3p l\u00fd v\u00e0 \u0111\u01b0a ra insights h\u1eefu \u00edch.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Kh\u1ea3 n\u0103ng n\u00e0y \u0111\u00f2i h\u1ecfi t\u01b0 duy t\u1ed5ng h\u1ee3p v\u1ec1 h\u1ea1 t\u1ea7ng d\u1eef li\u1ec7u, k\u1ef9 n\u0103ng ph\u00e2n t\u00edch, v\u00e0 c\u00e1ch giao ti\u1ebfp b\u1eb1ng h\u00ecnh \u1ea3nh (data visualization) sao cho r\u00f5 r\u00e0ng, d\u1ec5 hi\u1ec3u.<\/span><\/p>\n<p><b>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i ch\u1ecdn KPI d\u1ef1a tr\u00ean m\u1ee5c ti\u00eau kinh doanh: n\u1ebfu m\u1ee5c ti\u00eau l\u00e0 t\u0103ng chuy\u1ec3n \u0111\u1ed5i, th\u00ec dashboard s\u1ebd c\u00f3 c\u00e1c ch\u1ec9 s\u1ed1 nh\u01b0 Conversion Rate, Revenue per Visit, v.v.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">B\u1ed1 c\u1ee5c th\u01b0\u1eddng theo th\u1ee9 t\u1ef1:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">(1) KPI c\u1ed1t l\u00f5i \u1edf tr\u00ean c\u00f9ng;<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(2) Xu h\u01b0\u1edbng chung (line chart);<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(3) Chi ti\u1ebft theo ph\u00e2n kh\u00fac hay danh m\u1ee5c (bar chart, table)<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(4) Th\u00eam ph\u1ea7n \u201cKey Insights\/Rfaecommendation\u201d \u1edf g\u00f3c \u0111\u1ec3 ghi ch\u00fa c\u00e1c \u0111i\u1ec3m \u0111\u1eb7c bi\u1ec7t<\/span><\/li>\n<\/ul>\n<h3><strong>L\u00e0m th\u1ebf n\u00e0o \u0111\u1ec3 b\u1ea1n chuy\u1ec3n m\u1ee5c ti\u00eau kinh doanh chung chung (v\u00ed d\u1ee5: \u2018T\u0103ng doanh thu 20% trong qu\u00fd t\u1edbi\u2019) th\u00e0nh c\u00e1c metric c\u1ee5 th\u1ec3, k\u00e8m v\u00ed d\u1ee5 minh ho\u1ea1?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">K\u1ef9 n\u0103ng thi\u1ebft k\u1ebf v\u00e0 qu\u1ea3n l\u00fd metric l\u00e0 m\u1ed9t trong nh\u1eefng y\u1ebfu t\u1ed1 c\u1ed1t l\u00f5i c\u1ee7a Data Analyst. Doanh nghi\u1ec7p mu\u1ed1n th\u1ea5y b\u1ea1n bi\u1ebft c\u00e1ch chuy\u1ec3n m\u1ee5c ti\u00eau kinh doanh t\u1ed5ng th\u1ec3 th\u00e0nh nh\u1eefng ch\u1ec9 s\u1ed1 \u0111o l\u01b0\u1eddng c\u1ee5 th\u1ec3, c\u00f3 th\u1ec3 theo d\u00f5i b\u1eb1ng h\u1ea1 t\u1ea7ng d\u1eef li\u1ec7u ph\u00f9 h\u1ee3p, t\u1eeb CRM \u0111\u1ebfn c\u00f4ng c\u1ee5 ph\u00e2n t\u00edch nh\u01b0 Google Analytics.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Vi\u1ec7c n\u00e0y \u0111\u00f2i h\u1ecfi t\u01b0 duy t\u1ed5 ch\u1ee9c d\u1eef li\u1ec7u, kh\u1ea3 n\u0103ng ph\u00e2n t\u00edch, v\u00e0 k\u1ef9 n\u0103ng x\u1eed l\u00fd trong c\u00e1c m\u00f4i tr\u01b0\u1eddng c\u00f4ng ngh\u1ec7 cao.<\/span><\/p>\n<p><strong>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/strong><\/p>\n<p><b>B\u01b0\u1edbc 1: <\/b>Ph\u00e2n r\u00e3 c\u00e1c y\u1ebfu t\u1ed1 ch\u00ednh<span style=\"font-weight: 400;\"> (drivers):<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">S\u1ed1 \u0111\u01a1n h\u00e0ng (Number of Orders)<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">Gi\u00e1 tr\u1ecb trung b\u00ecnh \u0111\u01a1n h\u00e0ng (AOV)<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">T\u1ea7n su\u1ea5t mua l\u1ea1i (Purchase Frequency)<\/span><\/li>\n<\/ul>\n<p><b>B\u01b0\u1edbc 2: <\/b>X\u00e1c \u0111\u1ecbnh metric &amp; \u0111\u1eb7t target<span style=\"font-weight: 400;\"> cho t\u1eebng y\u1ebfu t\u1ed1:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">#Orders: +15% so v\u1edbi qu\u00fd tr\u01b0\u1edbc<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">AOV: t\u0103ng t\u1eeb 500.000 l\u00ean 600.000 \u0111\u1ed3ng<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"2\"><span style=\"font-weight: 400;\">T\u1ea7n su\u1ea5t mua l\u1ea1i: t\u1eeb 20% l\u00ean 25%<\/span><\/li>\n<\/ul>\n<p><b>B\u01b0\u1edbc 3: <\/b>Th\u1ed1ng nh\u1ea5t c\u00e1ch \u0111o l\u01b0\u1eddng, ngu\u1ed3n d\u1eef li\u1ec7u, t\u1ea7n su\u1ea5t b\u00e1o c\u00e1o<span style=\"font-weight: 400;\"> (VD: l\u1ea5y t\u1eeb CRM, Google Analytics\u2026).<\/span><\/p>\n<p><b>B\u01b0\u1edbc 4: <\/b>Theo d\u00f5i th\u01b0\u1eddng xuy\u00ean<span style=\"font-weight: 400;\"> v\u00e0 <\/span>\u0111i\u1ec1u ch\u1ec9nh<span style=\"font-weight: 400;\"> chi\u1ebfn l\u01b0\u1ee3c n\u1ebfu th\u1ea5y k\u1ebft qu\u1ea3 kh\u00f4ng \u0111\u1ea1t k\u1ef3 v\u1ecdng.<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng \u0111\u1ec1 xu\u1ea5t m\u1ed9t metric m\u1edbi \u0111\u1ec3 \u0111o l\u01b0\u1eddng hi\u1ec7u su\u1ea5t s\u1ea3n ph\u1ea9m ch\u01b0a? Quy tr\u00ecnh x\u00e2y d\u1ef1ng ra sao?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Tr\u01b0\u1edbc \u0111\u00e2y, c\u00f4ng ty mu\u1ed1n theo d\u00f5i m\u1ee9c \u0111\u1ed9 \u201cch\u1ee7 \u0111\u1ed9ng\u201d c\u1ee7a user (active user).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u1eeb y\u00eau c\u1ea7u \u0111\u00f3, t\u00f4i \u0111\u1ec1 xu\u1ea5t m\u1ed9t metric m\u1edbi l\u00e0 \u201cActive Session Rate\u201d v\u1edbi \u0111\u1ecbnh ngh\u0129a bao g\u1ed3m S\u1ed1 phi\u00ean \u0111\u0103ng nh\u1eadp &gt;= 2 ph\u00fat \/ t\u1ed5ng user.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">\u0110\u1ec3 x\u00e2y d\u1ef1ng metric n\u00e0y, t\u00f4i trao \u0111\u1ed5i v\u1edbi team k\u1ef9 thu\u1eadt \u0111\u1ec3 ch\u1eafc ch\u1eafn log th\u1eddi gian ch\u00ednh x\u00e1c, r\u1ed3i t\u00ednh trung b\u00ecnh h\u00e0ng ng\u00e0y (DAU), h\u00e0ng tu\u1ea7n (WAU).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Sau \u0111\u00f3, t\u00f4i A\/B Testing xem metric n\u00e0y c\u00f3 ph\u1ea3n \u00e1nh trung th\u1ef1c \u201cuser g\u1eafn b\u00f3\u201d hay kh\u00f4ng. K\u1ebft qu\u1ea3 cho th\u1ea5y n\u00f3 d\u1ef1 \u0111o\u00e1n kh\u00e1 t\u1ed1t t\u1ef7 l\u1ec7 quay l\u1ea1i c\u1ee7a kh\u00e1ch h\u00e0ng.<\/span><\/p>\n<h3><strong>Theo b\u1ea1n, metric c\u00f3 th\u1ec3 b\u1ecb \u201cgian l\u1eadn\u201d ho\u1eb7c di\u1ec5n gi\u1ea3i sai l\u1ec7ch nh\u01b0 th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">M\u1ed9t v\u00ed d\u1ee5 l\u00e0 \u201cpageview\u201d \u2014 c\u00f3 th\u1ec3 b\u1ecb l\u00e0m gi\u1ea3 b\u1eb1ng bot ho\u1eb7c user refresh nhi\u1ec1u l\u1ea7n. <\/span><span style=\"font-weight: 400;\">T\u01b0\u01a1ng t\u1ef1, \u201cconversion rate\u201d c\u0169ng c\u00f3 th\u1ec3 t\u0103ng \u1ea3o n\u1ebfu ch\u00fang ta l\u1ecdc m\u1ea5t d\u1eef li\u1ec7u user truy c\u1eadp nh\u01b0ng kh\u00f4ng c\u00f3 \u00fd \u0111\u1ecbnh mua h\u00e0ng.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Vi\u1ec7c \u201cgian l\u1eadn\u201d n\u00e0y th\u01b0\u1eddng li\u00ean quan \u0111\u1ebfn c\u00e1ch \u0111\u1ecbnh ngh\u0129a, hay do incentive c\u1ee7a team (mu\u1ed1n KPI tr\u00f4ng \u0111\u1eb9p). T\u00f4i lu\u00f4n khuy\u1ebfn kh\u00edch doanh nghi\u1ec7p cross-check d\u1eef li\u1ec7u, xem logic \u0111o l\u01b0\u1eddng ch\u1eb7t ch\u1ebd kh\u00f4ng, v\u00e0 t\u1ea1o b\u1ed9 l\u1ecdc \u0111\u1ec3 lo\u1ea1i b\u1ecf h\u00e0nh vi \u1ea3o.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">D\u01b0\u1edbi \u0111\u00e2y l\u00e0 m\u1ed9t v\u00e0i <\/span>metric<span style=\"font-weight: 400;\"> c\u0169ng d\u1ec5 b\u1ecb \u201cl\u00e0m \u0111\u1eb9p\u201d ho\u1eb7c \u201cgian l\u1eadn\u201d t\u01b0\u01a1ng t\u1ef1 nh\u01b0 tr\u01b0\u1eddng h\u1ee3p \u201cpageview\u201d hay \u201cconversion rate\u201d:<\/span><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Click-through Rate (CTR)<\/b><span style=\"font-weight: 400;\">: <\/span><span style=\"font-weight: 400;\">D\u1ec5 b\u1ecb b\u00f3p m\u00e9o n\u1ebfu c\u00f3 bot ho\u1eb7c ng\u01b0\u1eddi d\u00f9ng b\u1ea5m \u0111i b\u1ea5m l\u1ea1i li\u00ean t\u1ee5c v\u00e0o link\/qu\u1ea3ng c\u00e1o.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>User Sign-ups \/ Registrations<\/b><span style=\"font-weight: 400;\">: <\/span><span style=\"font-weight: 400;\">C\u00f3 th\u1ec3 b\u1ecb \u201cph\u00f3ng \u0111\u1ea1i\u201d b\u1eb1ng c\u00e1ch t\u1ea1o nhi\u1ec1u t\u00e0i kho\u1ea3n \u1ea3o, email \u1ea3o, ho\u1eb7c ch\u01b0\u01a1ng tr\u00ecnh gi\u1edbi thi\u1ec7u (referral) t\u1ef1 spam.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>App Downloads<\/b><span style=\"font-weight: 400;\">: <\/span><span style=\"font-weight: 400;\">M\u1ed9t s\u1ed1 chi\u1ebfn d\u1ecbch qu\u1ea3ng c\u00e1o ho\u1eb7c bot c\u00f3 th\u1ec3 c\u1ed1 t\u00ecnh t\u1ea3i app nhi\u1ec1u l\u1ea7n \u0111\u1ec3 t\u0103ng s\u1ed1 l\u01b0\u1ee3t t\u1ea3i \u1ea3o.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Engagement tr\u00ean m\u1ea1ng x\u00e3 h\u1ed9i<\/b><span style=\"font-weight: 400;\"> (l\u01b0\u1ee3t like, share, comment): <\/span><span style=\"font-weight: 400;\">Nhi\u1ec1u tr\u01b0\u1eddng h\u1ee3p s\u1eed d\u1ee5ng tool auto-like, auto-follow, ho\u1eb7c thu\u00ea \u201cch\u1ee3 \u0111en\u201d \u0111\u1ec3 t\u0103ng t\u01b0\u01a1ng t\u00e1c.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Rating\/Review<\/b><span style=\"font-weight: 400;\">: <\/span><span style=\"font-weight: 400;\">C\u00e1c \u0111\u00e1nh gi\u00e1 sao (5 sao, 4 sao,\u2026) ho\u1eb7c review s\u1ea3n ph\u1ea9m\/d\u1ecbch v\u1ee5 c\u00f3 th\u1ec3 b\u1ecb thao t\u00fang (mua \u0111\u00e1nh gi\u00e1 \u1ea3o, x\u00f3a \u0111\u00e1nh gi\u00e1 x\u1ea5u,\u2026).<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><b>Email Open Rate<\/b><span style=\"font-weight: 400;\">: <\/span><span style=\"font-weight: 400;\">Bot ho\u1eb7c script t\u1ef1 \u0111\u1ed9ng m\u1edf mail (th\u01b0\u1eddng do b\u00ean g\u1eedi ki\u1ec3m so\u00e1t, nh\u1ea5t l\u00e0 n\u1ebfu m\u1ee5c ti\u00eau KPI b\u1ecb g\u1eafn v\u1edbi open rate).<\/span><\/li>\n<\/ul>\n<h3><strong>B\u1ea1n t\u1eebng x\u00e2y d\u1ef1ng pipeline ETL t\u1ef1 \u0111\u1ed9ng ch\u01b0a? D\u00f9ng c\u00f4ng c\u1ee5 g\u00ec (Airflow, Luigi\u2026)?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">C\u00f3, t\u00f4i d\u00f9ng Airflow cho m\u1ed9t d\u1ef1 \u00e1n li\u00ean quan \u0111\u1ebfn d\u1eef li\u1ec7u CRM &amp; Google Analytics.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i thi\u1ebft k\u1ebf DAG (Directed Acyclic Graph) \u0111\u1ec3 t\u1ef1 \u0111\u1ed9ng ho\u00e1 chu\u1ed7i task: t\u1ea3i d\u1eef li\u1ec7u th\u00f4, l\u00e0m s\u1ea1ch, transform sang schema chung, r\u1ed3i l\u01b0u v\u00e0o data warehouse (Redshift). Airflow cho ph\u00e9p t\u00f4i thi\u1ebft l\u1eadp l\u1ecbch ch\u1ea1y (cron) v\u00e0 theo d\u00f5i log d\u1ec5 d\u00e0ng. Nh\u1edd t\u1ef1 \u0111\u1ed9ng ho\u00e1, ch\u00fang t\u00f4i ti\u1ebft ki\u1ec7m trung b\u00ecnh 2-3 gi\u1edd th\u1ee7 c\u00f4ng m\u1ed7i ng\u00e0y.<\/span><\/p>\n<h3><strong>Data Modeling (Star schema, Snowflake schema) c\u00f3 vai tr\u00f2 g\u00ec trong ph\u00e2n t\u00edch?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">C\u00e1c schema n\u00e0y gi\u00fap t\u1ed5 ch\u1ee9c d\u1eef li\u1ec7u trong data warehouse g\u1ecdn g\u00e0ng, gi\u1ea3m b\u1edbt tr\u00f9ng l\u1eb7p, \u0111\u1ed3ng th\u1eddi t\u1ed1i \u01b0u t\u1ed1c \u0111\u1ed9 truy v\u1ea5n. <strong>Star schema<\/strong>, v\u1edbi fact table \u1edf trung t\u00e2m v\u00e0 dimension table xung quanh, r\u1ea5t tr\u1ef1c quan cho c\u00f4ng c\u1ee5 BI.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">C\u00f2n <strong>Snowflake schema<\/strong> \u201cchu\u1ea9n ho\u00e1\u201d th\u00eam dimension table, gi\u1ea3m tr\u00f9ng l\u1eb7p c\u1ed9t. T\u00f9y nhu c\u1ea7u (kh\u1ed1i l\u01b0\u1ee3ng d\u1eef li\u1ec7u, t\u1ea7n su\u1ea5t c\u1eadp nh\u1eadt, t\u00e0i nguy\u00ean h\u1ec7 th\u1ed1ng) m\u00e0 t\u00f4i ch\u1ecdn m\u00f4 h\u00ecnh ph\u00f9 h\u1ee3p.<\/span><\/p>\n<h3><strong>A\/B Testing: B\u1ea1n hi\u1ec3u v\u1ec1 hypothesis testing, p-value, significance level?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Khi A\/B Testing, t\u00f4i gi\u1ea3 \u0111\u1ecbnh H0: \u201ckh\u00f4ng c\u00f3 kh\u00e1c bi\u1ec7t gi\u1eefa phi\u00ean b\u1ea3n A v\u00e0 B\u201d (th\u01b0\u1eddng l\u00e0 version c\u0169 vs. version m\u1edbi).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">N\u1ebfu k\u1ebft qu\u1ea3 test cho ra p-value nh\u1ecf h\u01a1n significance level (v\u00ed d\u1ee5 0.05), t\u00f4i b\u00e1c b\u1ecf H0, ngh\u0129a l\u00e0 hai phi\u00ean b\u1ea3n c\u00f3 s\u1ef1 kh\u00e1c bi\u1ec7t \u0111\u00e1ng k\u1ec3.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ngo\u00e0i ra, t\u00f4i c\u0169ng xem x\u00e9t practical significance, v\u00ec p-value nh\u1ecf ch\u01b0a ch\u1eafc \u0111\u00e3 c\u00f3 \u00fd ngh\u0129a kinh doanh n\u1ebfu ch\u00eanh l\u1ec7ch hi\u1ec7u su\u1ea5t ch\u1ec9 0.1%.<\/span><\/p>\n<h3><strong>B\u1ea1n c\u00f3 kinh nghi\u1ec7m \u00e1p d\u1ee5ng regression ho\u1eb7c ph\u00e2n t\u00edch time series \u0111\u1ec3 d\u1ef1 b\u00e1o doanh s\u1ed1?<\/strong><\/h3>\n<h4><strong>Regression (h\u1ed3i quy) trong d\u1ef1 b\u00e1o<\/strong><\/h4>\n<p><span style=\"font-weight: 400;\">L\u00e0 ph\u01b0\u01a1ng ph\u00e1p s\u1eed d\u1ee5ng m\u1ed1i quan h\u1ec7 gi\u1eefa bi\u1ebfn ph\u1ee5 thu\u1ed9c (th\u01b0\u1eddng l\u00e0 ch\u1ec9 s\u1ed1 c\u1ea7n d\u1ef1 b\u00e1o) v\u00e0 m\u1ed9t ho\u1eb7c nhi\u1ec1u bi\u1ebfn \u0111\u1ed9c l\u1eadp (y\u1ebfu t\u1ed1 \u1ea3nh h\u01b0\u1edfng) \u0111\u1ec3 d\u1ef1 \u0111o\u00e1n gi\u00e1 tr\u1ecb t\u01b0\u01a1ng lai.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u00ed d\u1ee5: D\u00f9ng doanh s\u1ed1 (bi\u1ebfn ph\u1ee5 thu\u1ed9c) v\u00e0 c\u00e1c bi\u1ebfn \u0111\u1ed9c l\u1eadp nh\u01b0 chi ph\u00ed qu\u1ea3ng c\u00e1o, xu h\u01b0\u1edbng th\u1ecb tr\u01b0\u1eddng&#8230; \u0111\u1ec3 \u01b0\u1edbc t\u00ednh doanh s\u1ed1 trong k\u1ef3 t\u1edbi.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u1edbi regression, t\u00f4i th\u01b0\u1eddng th\u1eed Linear Regression, Random Forest ho\u1eb7c XGBoost tu\u1ef3 tr\u01b0\u1eddng h\u1ee3p. T\u00f4i c\u0169ng ki\u1ec3m tra c\u00e1c ch\u1ec9 s\u1ed1 nh\u01b0 R-squared, MAPE \u0111\u1ec3 \u0111\u00e1nh gi\u00e1 \u0111\u1ed9 ch\u00ednh x\u00e1c.<\/span><\/p>\n<h4><strong>Time series (chu\u1ed7i th\u1eddi gian) trong d\u1ef1 b\u00e1o<\/strong><\/h4>\n<p><span style=\"font-weight: 400;\">T\u1eadp trung v\u00e0o vi\u1ec7c d\u1ef1 \u0111o\u00e1n d\u1ef1a tr\u00ean ch\u00ednh d\u1eef li\u1ec7u qu\u00e1 kh\u1ee9 c\u1ee7a m\u1ed9t bi\u1ebfn theo th\u1eddi gian (ng\u00e0y, tu\u1ea7n, th\u00e1ng&#8230;).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u00ed d\u1ee5: Ph\u00e2n t\u00edch doanh s\u1ed1 b\u00e1n h\u00e0ng qua c\u00e1c th\u00e1ng \u0111\u1ec3 t\u00ecm quy lu\u1eadt (xu h\u01b0\u1edbng, m\u00f9a v\u1ee5, chu k\u1ef3&#8230;) v\u00e0 d\u1ef1 b\u00e1o doanh s\u1ed1 th\u00e1ng ti\u1ebfp theo.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u1ec1 time series, t\u00f4i c\u00f3 d\u00f9ng ARIMA, SARIMA ho\u1eb7c Prophet \u0111\u1ec3 x\u1eed l\u00fd d\u1eef li\u1ec7u theo m\u00f9a, v\u00ed d\u1ee5 chu k\u1ef3 mua s\u1eafm (holiday sale). M\u1ed9t l\u1ea7n, t\u00f4i \u00e1p d\u1ee5ng m\u00f4 h\u00ecnh Prophet cho d\u1eef li\u1ec7u 2 n\u0103m, sai s\u1ed1 MAPE dao \u0111\u1ed9ng kho\u1ea3ng 10%, cho ph\u00e9p c\u00f4ng ty d\u1ef1 tr\u00f9 h\u00e0ng t\u1ed3n kho kh\u00e1 s\u00e1t v\u1edbi nhu c\u1ea7u th\u1ef1c.<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng g\u1eb7p th\u00e1ch th\u1ee9c g\u00ec khi l\u00e0m vi\u1ec7c v\u1edbi d\u1eef li\u1ec7u c\u00f3 quy m\u00f4 r\u1ea5t l\u1edbn (<a href=\"https:\/\/itviec.com\/blog\/big-data-la-gi\/\" target=\"_blank\" rel=\"noopener\">Big Data<\/a>)?\u00a0<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Khi d\u1eef li\u1ec7u v\u01b0\u1ee3t m\u1ee9c v\u00e0i tr\u0103m tri\u1ec7u d\u00f2ng, t\u00f4i ph\u1ea3i c\u00e2n nh\u1eafc chia nh\u1ecf th\u00e0nh nhi\u1ec1u partition, ho\u1eb7c chuy\u1ec3n sang Spark\/Databricks \u0111\u1ec3 x\u1eed l\u00fd ph\u00e2n t\u00e1n.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">B\u1ed9 nh\u1edb RAM h\u1ea1n h\u1eb9p c\u0169ng bu\u1ed9c t\u00f4i thay \u0111\u1ed5i c\u00e1ch thao t\u00e1c, v\u00ed d\u1ee5 d\u00f9ng chunk, stream ho\u1eb7c k\u1ebft h\u1ee3p t\u00ednh to\u00e1n song song.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">M\u1eb7t kh\u00e1c, t\u00f4i ch\u00fa tr\u1ecdng c\u1ea5u tr\u00fac pipeline, tr\u00e1nh ch\u1ea1y l\u1ec7nh \u201cgroup by\u201d tr\u00ean to\u00e0n b\u1ed9 dataset m\u1ed9t c\u00e1ch thi\u1ebfu ki\u1ec3m so\u00e1t.<\/span><\/p>\n<h3><strong>B\u1ea1n th\u01b0\u1eddng ph\u00e2n t\u00edch d\u1eef li\u1ec7u ch\u1ea5t l\u01b0\u1ee3ng k\u00e9m (thi\u1ebfu chu\u1ea9n ho\u00e1, nhi\u1ec1u l\u1ed7i) nh\u01b0 th\u1ebf n\u00e0o \u0111\u1ec3 v\u1eabn \u0111\u1ea3m b\u1ea3o k\u1ebft qu\u1ea3 tin c\u1eady?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Tr\u01b0\u1edbc ti\u00ean, t\u00f4i ch\u1ea1y b\u01b0\u1edbc \u201cdata profiling\u201d \u0111\u1ec3 th\u1ed1ng k\u00ea ph\u1ea7n tr\u0103m gi\u00e1 tr\u1ecb thi\u1ebfu, ki\u1ec3m tra t\u00ednh nh\u1ea5t qu\u00e1n c\u1ee7a c\u1ed9t (\u0111\u1ecbnh d\u1ea1ng, m\u00e3 qu\u1ed1c gia\u2026), t\u00ecm duplicate. T\u00f4i t\u1ea1o b\u1ed9 quy t\u1eafc \u201cdata validation\u201d (v\u00ed d\u1ee5: c\u1ed9t email ph\u1ea3i ch\u1ee9a k\u00fd t\u1ef1 <\/span><span style=\"font-weight: 400;\">@<\/span><span style=\"font-weight: 400;\">) v\u00e0 ch\u1ea1y script \u0111\u1ec3 t\u1ef1 \u0111\u1ed9ng b\u00e1o l\u1ed7i.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">N\u1ebfu t\u00ecm th\u1ea5y ch\u00eanh l\u1ec7ch l\u1edbn, t\u00f4i b\u00e1o cho stakeholder ho\u1eb7c l\u1eadp tr\u00ecnh vi\u00ean ph\u1ee5 tr\u00e1ch thu th\u1eadp d\u1eef li\u1ec7u \u0111\u1ec3 s\u1eeda.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Trong tr\u01b0\u1eddng h\u1ee3p b\u1ea5t kh\u1ea3 kh\u00e1ng, t\u00f4i \u0111\u00e1nh d\u1ea5u r\u00f5 trong b\u00e1o c\u00e1o c\u00e1c d\u00f2ng d\u1eef li\u1ec7u ch\u01b0a \u0111\u1ea1t chu\u1ea9n, \u0111\u1ec3 kh\u00f4ng b\u1ecb l\u1eabn v\u00e0o k\u1ebft qu\u1ea3 cu\u1ed1i c\u00f9ng.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Cac_cau_hoi_phong_van_Data_Analyst_ve_ky_nang_mem_va_phan_tich_kinh_doanh\"><\/span><b>C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 k\u1ef9 n\u0103ng m\u1ec1m v\u00e0 ph\u00e2n t\u00edch kinh doanh<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3><strong>Khi nh\u1eadn y\u00eau c\u1ea7u m\u01a1 h\u1ed3 t\u1eeb stakeholder, b\u1ea1n l\u00e0m th\u1ebf n\u00e0o \u0111\u1ec3 x\u00e1c \u0111\u1ecbnh m\u1ee5c ti\u00eau ph\u00e2n t\u00edch? V\u00ed d\u1ee5 h\u1ecd y\u00eau c\u1ea7u ph\u00e2n t\u00edch l\u01b0\u1ee3t \u0111\u0103ng k\u00fd m\u1edbi c\u1ee7a trang web?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Khi nh\u1eadn \u0111\u01b0\u1ee3c m\u1ed9t y\u00eau c\u1ea7u m\u01a1 h\u1ed3 nh\u01b0<\/span><span style=\"font-weight: 400;\"> \u201cH\u00e3y ph\u00e2n t\u00edch l\u01b0\u1ee3t \u0111\u0103ng k\u00fd m\u1edbi tr\u00ean trang web trong th\u1eddi gian v\u1eeba qua\u201d, <\/span><span style=\"font-weight: 400;\">t\u00f4i th\u01b0\u1eddng \u0111\u1eb7t nhi\u1ec1u c\u00e2u h\u1ecfi \u201cT\u1ea1i sao?\u201d \u0111\u1ec3 l\u00e0m r\u00f5 h\u1ecd mu\u1ed1n \u0111o l\u01b0\u1eddng c\u00e1i g\u00ec.<\/span><\/p>\n<blockquote><p><strong>M\u1ed9t y\u00eau c\u1ea7u m<\/strong><b>\u01a1 h\u1ed3<\/b><span style=\"font-weight: 400;\"> ngh\u0129a l\u00e0 stakeholder ch\u1ec9 n\u00f3i chung chung: \u201cPh\u00e2n t\u00edch l\u01b0\u1ee3t \u0111\u0103ng k\u00fd\u201d m\u00e0 kh\u00f4ng \u0111\u1ec1 c\u1eadp chi ti\u1ebft <\/span><b>m\u1ee5c ti\u00eau, ph\u1ea1m vi, th\u1eddi gian,<\/b><span style=\"font-weight: 400;\"> hay <\/span><b>ch\u1ec9 s\u1ed1 quan tr\u1ecdng<\/b><span style=\"font-weight: 400;\"> (KPI) c\u1ea7n theo d\u00f5i.<\/span><\/p><\/blockquote>\n<p><span style=\"font-weight: 400;\">D\u01b0\u1edbi \u0111\u00e2y l\u00e0 m\u1ed9t v\u00ed d\u1ee5 c\u00e1c c\u00e2u h\u1ecfi \u201cT\u1ea1i sao?\u201d t\u00f4i s\u1eed d\u1ee5ng \u0111\u1ec3 <\/span><b>l\u00e0m r\u00f5 y\u00eau c\u1ea7u<\/b><span style=\"font-weight: 400;\">:<\/span><\/p>\n<p><strong>1. \u201cT\u1ea1i sao b\u00ean anh\/ch\u1ecb mu\u1ed1n ph\u00e2n t\u00edch l\u01b0\u1ee3t \u0111\u0103ng k\u00fd m\u1edbi?\u201d<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ee5c \u0111\u00edch: X\u00e1c \u0111\u1ecbnh <\/span><b>l\u1ee3i \u00edch th\u1ef1c s\u1ef1<\/b><span style=\"font-weight: 400;\"> h\u1ecd mu\u1ed1n \u0111\u1ea1t \u0111\u01b0\u1ee3c (VD: T\u1ed1i \u01b0u chi\u1ebfn d\u1ecbch marketing, c\u1ea3i thi\u1ec7n tr\u1ea3i nghi\u1ec7m ng\u01b0\u1eddi d\u00f9ng, t\u0103ng t\u1ef7 l\u1ec7 chuy\u1ec3n \u0111\u1ed5i,&#8230;).<\/span><\/p>\n<p><strong>2. \u201cT\u1ea1i sao vi\u1ec7c hi\u1ec3u r\u00f5 nguy\u00ean nh\u00e2n t\u0103ng\/gi\u1ea3m c\u1ee7a l\u01b0\u1ee3t \u0111\u0103ng k\u00fd m\u1edbi l\u1ea1i quan tr\u1ecdng l\u00fac n\u00e0y?\u201d<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ee5c \u0111\u00edch: X\u00e1c \u0111\u1ecbnh <\/span><b>m\u1ee9c \u0111\u1ed9 \u01b0u ti\u00ean<\/b><span style=\"font-weight: 400;\">, b\u1ed1i c\u1ea3nh kinh doanh (v\u00ed d\u1ee5: S\u1eafp ra m\u1eaft chi\u1ebfn d\u1ecbch qu\u1ea3ng c\u00e1o, c\u1ea7n s\u1ed1 li\u1ec7u thuy\u1ebft ph\u1ee5c ban l\u00e3nh \u0111\u1ea1o,&#8230;).<\/span><\/p>\n<p><strong>3. &#8220;T\u1ea1i sao anh\/ch\u1ecb mu\u1ed1n ph\u00e2n t\u00edch b\u00e2y gi\u1edd, thay v\u00ec th\u1eddi \u0111i\u1ec3m kh\u00e1c?\u201d<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ee5c \u0111\u00edch: L\u00e0m r\u00f5 <\/span><b>th\u1eddi gian v\u00e0 deadline<\/b><span style=\"font-weight: 400;\">; t\u1eeb \u0111\u00f3 x\u00e1c \u0111\u1ecbnh <\/span><b>t\u00e0i nguy\u00ean<\/b><span style=\"font-weight: 400;\"> v\u00e0 <\/span><b>k\u1ebf ho\u1ea1ch<\/b><span style=\"font-weight: 400;\"> ph\u00e2n t\u00edch.<\/span><\/p>\n<p><strong>4. \u201cNh\u1eefng h\u00e0nh \u0111\u1ed9ng ho\u1eb7c quy\u1ebft \u0111\u1ecbnh n\u00e0o s\u1ebd \u0111\u01b0\u1ee3c \u0111\u01b0a ra d\u1ef1a tr\u00ean k\u1ebft qu\u1ea3 ph\u00e2n t\u00edch n\u00e0y?\u201d<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ee5c \u0111\u00edch: X\u00e1c \u0111\u1ecbnh <\/span><b>c\u00e1ch d\u00f9ng k\u1ebft qu\u1ea3<\/b><span style=\"font-weight: 400;\"> ph\u00e2n t\u00edch trong th\u1ef1c t\u1ebf, \u0111\u1ea3m b\u1ea3o ch\u00fang ta thu th\u1eadp <\/span><b>\u0111\u00fang lo\u1ea1i d\u1eef li\u1ec7u<\/b><span style=\"font-weight: 400;\"> v\u00e0 \u0111o <\/span><b>\u0111\u00fang KPI<\/b><span style=\"font-weight: 400;\">.<\/span><\/p>\n<p><strong>5. \u201cN\u1ebfu k\u1ebft qu\u1ea3 ph\u00e2n t\u00edch cho th\u1ea5y l\u01b0\u1ee3t \u0111\u0103ng k\u00fd t\u0103ng\/gi\u1ea3m do nguy\u00ean nh\u00e2n X, anh\/ch\u1ecb s\u1ebd c\u00f3 nh\u1eefng ph\u01b0\u01a1ng \u00e1n ra quy\u1ebft \u0111\u1ecbnh c\u1ee5 th\u1ec3 n\u00e0o?\u201d<\/strong><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ee5c \u0111\u00edch: L\u00e0m r\u00f5 <\/span><b>ph\u01b0\u01a1ng \u00e1n h\u00e0nh \u0111\u1ed9ng<\/b><span style=\"font-weight: 400;\"> (action plan) sau khi c\u00f3 k\u1ebft qu\u1ea3.<\/span><\/p>\n<h3><strong>B\u1ea1n ti\u1ebfp c\u1eadn m\u1ed9t t\u00ecnh hu\u1ed1ng ph\u1ea3i gi\u1ea3i th\u00edch m\u1ed9t kh\u00e1i ni\u1ec7m k\u1ef9 thu\u1eadt cho ng\u01b0\u1eddi kh\u00f4ng chuy\u00ean nh\u01b0 th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p>M\u1ed9t s\u1ed1 nguy\u00ean t\u1eafc t\u00f4i th\u01b0\u1eddng tu\u00e2n theo \u0111\u1ec3 gi\u1ea3i th\u00edch m\u1ed9t kh\u00e1i ni\u1ec7m k\u1ef9 thu\u1eadt cho ng\u01b0\u1eddi kh\u00f4ng chuy\u00ean d\u1ec5 hi\u1ec3u h\u01a1n:<\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">B\u1eaft \u0111\u1ea7u gi\u1ea3i th\u00edch b\u1eb1ng \u1ea9n d\u1ee5 th\u1ef1c t\u1ebf (v\u00ed d\u1ee5: gi\u1ea3i th\u00edch \u201cdatabase index\u201d nh\u01b0 m\u1ee5c l\u1ee5c s\u00e1ch, t\u00ecm trang nhanh h\u01a1n).<\/span><\/li>\n<li>\u01afu ti\u00ean d\u00f9ng bi\u1ec3u \u0111\u1ed3 ho\u1eb7c v\u00ed d\u1ee5 \u0111\u1ec3 minh ho\u1ea1. T\u00f4i c\u0169ng th\u01b0\u1eddng t\u1ea1o slide k\u00e8m bi\u1ec3u \u0111\u1ed3 \u0111\u01a1n gi\u1ea3n \u0111\u1ec3 tr\u00ecnh b\u00e0y tr\u1ef1c quan.<\/li>\n<li>H\u1ea1n ch\u1ebf vi\u1ebft t\u1eaft k\u1ef9 thu\u1eadt v\u00e0 ki\u1ec3m tra \u0111\u1ed1i ph\u01b0\u01a1ng c\u00f3 theo k\u1ecbp kh\u00f4ng.<\/li>\n<li>N\u1ebfu c\u1ea7n, chia kh\u00e1i ni\u1ec7m th\u00e0nh nhi\u1ec1u c\u1ea5p \u0111\u1ed9, t\u1eeb s\u01a1 khai (kh\u00f4ng d\u00f9ng t\u1eeb chuy\u00ean m\u00f4n) \u0111\u1ebfn trung c\u1ea5p (c\u00f3 m\u1ed9t v\u00e0i gi\u1ea3i th\u00edch code ng\u1eafn).<\/li>\n<\/ul>\n<h3><strong>KPI l\u00e0 g\u00ec? Cho v\u00ed d\u1ee5 v\u1ec1 KPI trong th\u01b0\u01a1ng m\u1ea1i \u0111i\u1ec7n t\u1eed ho\u1eb7c SaaS?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Ng\u01b0\u1eddi ph\u1ecfng v\u1ea5n mu\u1ed1n ki\u1ec3m tra kh\u1ea3 n\u0103ng b\u1ea1n n\u1eafm v\u1eefng ki\u1ebfn tr\u00fac h\u1ea1 t\u1ea7ng d\u1eef li\u1ec7u (data architecture) v\u00e0 ph\u00e2n t\u00edch ph\u1ea7n m\u1ec1m (software analysis), nh\u1eb1m \u0111\u1ea3m b\u1ea3o b\u1ea1n c\u00f3 th\u1ec3 thi\u1ebft l\u1eadp, \u0111o l\u01b0\u1eddng, v\u00e0 t\u1ed1i \u01b0u KPI trong m\u00f4i tr\u01b0\u1eddng c\u00f4ng ngh\u1ec7 cao.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">H\u1ecd c\u0169ng mong b\u1ea1n hi\u1ec3u c\u00e1ch \u1ee9ng d\u1ee5ng v\u00e0 ph\u00e2n t\u00edch KPI kh\u00e1c nhau trong l\u0129nh v\u1ef1c nhi\u1ec1u c\u00f4ng ngh\u1ec7 (nh\u01b0 SaaS, th\u01b0\u01a1ng m\u1ea1i \u0111i\u1ec7n t\u1eed) so v\u1edbi c\u00e1c doanh nghi\u1ec7p \u00edt c\u00f4ng ngh\u1ec7 h\u01a1n (s\u1ea3n xu\u1ea5t ho\u1eb7c F&amp;B).<\/span><\/p>\n<p><b>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/b><\/p>\n<p><span style=\"font-weight: 400;\"><strong>KPI (Key Performance Indicator)<\/strong> l\u00e0 ch\u1ec9 s\u1ed1 ph\u1ea3n \u00e1nh hi\u1ec7u qu\u1ea3 ho\u1ea1t \u0111\u1ed9ng c\u1ed1t l\u00f5i.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u00ed d\u1ee5, trong th\u01b0\u01a1ng m\u1ea1i \u0111i\u1ec7n t\u1eed, KPI c\u00f3 th\u1ec3 l\u00e0 \u201ct\u1ef7 l\u1ec7 chuy\u1ec3n \u0111\u1ed5i\u201d (conversion rate) ho\u1eb7c \u201cgi\u00e1 tr\u1ecb trung b\u00ecnh m\u1ed7i \u0111\u01a1n h\u00e0ng\u201d (average order value).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u1edbi SaaS, c\u00f3 th\u1ec3 l\u00e0 \u201cMRR\u201d (Monthly Recurring Revenue) hay \u201cchurn rate\u201d (t\u1ef7 l\u1ec7 kh\u00e1ch h\u00e0ng r\u1eddi b\u1ecf). Nh\u1eefng KPI n\u00e0y gi\u00fap doanh nghi\u1ec7p \u0111o l\u01b0\u1eddng s\u1ee9c kh\u1ecfe kinh doanh m\u1ed9t c\u00e1ch tr\u1ef1c quan, t\u1eeb \u0111\u00f3 c\u00f3 \u0111\u1ecbnh h\u01b0\u1edbng c\u1ea3i thi\u1ec7n.<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng \u0111\u1ec1 xu\u1ea5t c\u1ea3i thi\u1ec7n chi\u1ebfn l\u01b0\u1ee3c marketing hay t\u1ed1i \u01b0u quy tr\u00ecnh b\u00e1n h\u00e0ng d\u1ef1a tr\u00ean d\u1eef li\u1ec7u nh\u01b0 th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Trong m\u1ed9t d\u1ef1 \u00e1n tr\u01b0\u1edbc, t\u00f4i ph\u00e2n t\u00edch d\u1eef li\u1ec7u kh\u00e1ch h\u00e0ng (CRM) v\u00e0 nh\u1eadn th\u1ea5y nh\u00f3m kh\u00e1ch c\u00f3 t\u1ef7 l\u1ec7 r\u1eddi b\u1ecf cao \u0111\u1ec1u c\u00f3 chung h\u00e0nh vi: \u00edt t\u01b0\u01a1ng t\u00e1c email, \u00edt \u0111\u0103ng nh\u1eadp.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u1eeb s\u1ed1 li\u1ec7u \u0111\u00f3, t\u00f4i \u0111\u1ec1 xu\u1ea5t chi\u1ebfn d\u1ecbch email remarketing nh\u1eafm ri\u00eang nh\u00f3m n\u00e0y, \u0111\u1ed3ng th\u1eddi t\u1ea1o \u01b0u \u0111\u00e3i \u0111\u1eb7c bi\u1ec7t khi h\u1ecd quay l\u1ea1i.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">K\u1ebft qu\u1ea3, t\u1ef7 l\u1ec7 ph\u1ea3n h\u1ed3i t\u0103ng ~20% v\u00e0 gi\u1ea3m ~5% churn trong th\u00e1ng ti\u1ebfp theo. Th\u00eam v\u00e0o \u0111\u00f3, t\u00f4i tr\u1ef1c quan ho\u00e1 k\u1ebft qu\u1ea3 \u0111\u1ec3 tr\u00ecnh b\u00e0y cho ph\u00f2ng Marketing, gi\u00fap h\u1ecd th\u1ea5y r\u00f5 \u201ctr\u01b0\u1edbc v\u00e0 sau\u201d d\u1ef1 \u00e1n.<\/span><\/p>\n<h3><strong>B\u1ea1n l\u00e0m sao \u0111\u1ec3 ph\u00e2n t\u00edch h\u00e0nh vi kh\u00e1ch h\u00e0ng, c\u1ea3i thi\u1ec7n t\u1ef7 l\u1ec7 chuy\u1ec3n \u0111\u1ed5i (conversion rate)?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">B\u01b0\u1edbc \u0111\u1ea7u ti\u00ean, t\u00f4i chia kh\u00e1ch h\u00e0ng th\u00e0nh c\u00e1c ph\u00e2n kh\u00fac (segmentation) d\u1ef1a tr\u00ean h\u00e0nh vi mua s\u1eafm (l\u1ecbch s\u1eed giao d\u1ecbch, l\u01b0\u1ee3t xem trang, \u0111\u1ed9 t\u01b0\u01a1ng t\u00e1c).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ti\u1ebfp theo, t\u00f4i thi\u1ebft l\u1eadp funnel (ph\u1ec5u) \u0111\u1ec3 xem \u1edf b\u01b0\u1edbc n\u00e0o kh\u00e1ch r\u01a1i r\u1ee5ng nhi\u1ec1u nh\u1ea5t (th\u00eam v\u00e0o gi\u1ecf h\u00e0ng nh\u01b0ng kh\u00f4ng thanh to\u00e1n, v.v.). D\u1eef li\u1ec7u n\u00e0y cho ph\u00e9p t\u00f4i ch\u1ea1y th\u1eed nghi\u1ec7m A\/B Testing \u1edf m\u1ed7i b\u01b0\u1edbc, v\u00ed d\u1ee5 thay \u0111\u1ed5i giao di\u1ec7n trang checkout, ho\u1eb7c g\u1ee3i \u00fd s\u1ea3n ph\u1ea9m li\u00ean quan.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Ph\u00e2n t\u00edch chi ti\u1ebft t\u1eebng bi\u1ebfn s\u1ed1, t\u00f4i ch\u1ecdn bi\u1ebfn n\u00e0o t\u00e1c \u0111\u1ed9ng m\u1ea1nh nh\u1ea5t \u0111\u1ebfn chuy\u1ec3n \u0111\u1ed5i \u0111\u1ec3 t\u1ed1i \u01b0u tr\u01b0\u1edbc.<\/span><\/p>\n<h3><strong>Khi g\u1eb7p v\u1ea5n \u0111\u1ec1 d\u1eef li\u1ec7u ph\u1ee9c t\u1ea1p, b\u1ea1n chia nh\u1ecf v\u00e0 x\u00e1c \u0111\u1ecbnh m\u1ee5c ti\u00eau th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">T\u00f4i th\u01b0\u1eddng m\u00f4 h\u00ecnh ho\u00e1 v\u1ea5n \u0111\u1ec1 theo t\u1eebng \u201cl\u1edbp\u201d:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">(1) X\u00e1c \u0111\u1ecbnh r\u00f5 m\u1ee5c \u0111\u00edch ph\u00e2n t\u00edch (gi\u1ea3i quy\u1ebft c\u00e2u h\u1ecfi g\u00ec?);<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(2) Xem ngu\u1ed3n d\u1eef li\u1ec7u, ch\u1ea5t l\u01b0\u1ee3ng d\u1eef li\u1ec7u;<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(3) Ch\u1ecdn ph\u01b0\u01a1ng ph\u00e1p ph\u00e2n t\u00edch (th\u1ed1ng k\u00ea m\u00f4 t\u1ea3, m\u00f4 h\u00ecnh d\u1ef1 \u0111o\u00e1n, v.v.);<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(4) Tri\u1ec3n khai ph\u00e2n t\u00edch v\u00e0 ki\u1ec3m \u0111\u1ecbnh k\u1ebft qu\u1ea3.<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Vi\u1ec7c chia nh\u1ecf gi\u00fap t\u00f4i ki\u1ec3m so\u00e1t r\u1ee7i ro, ph\u00e1t hi\u1ec7n s\u1edbm l\u1ed7i \u1edf t\u1eebng kh\u00e2u. T\u00f4i c\u0169ng l\u01b0u l\u1ea1i log v\u00e0 ghi ch\u00fa \u0111\u1ec3 quay v\u1ec1 khi c\u1ea7n.<\/span><\/p>\n<h3><strong>B\u1ea1n l\u00e0m th\u1ebf n\u00e0o \u0111\u1ec3 t\u00ecm ra root cause (nguy\u00ean nh\u00e2n g\u1ed1c r\u1ec5) khi d\u1eef li\u1ec7u \u201csai s\u1ed1\u201d ho\u1eb7c kh\u00f4ng kh\u1edbp?<\/strong><\/h3>\n<p><b>G\u00f3c nh\u00ecn nh\u00e0 tuy\u1ec3n d\u1ee5ng:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">Ng\u01b0\u1eddi ph\u1ecfng v\u1ea5n mu\u1ed1n \u0111\u1ea3m b\u1ea3o b\u1ea1n c\u00f3 \u0111\u1ee7 t\u01b0 duy v\u1ec1 h\u1ea1 t\u1ea7ng d\u1eef li\u1ec7u v\u00e0 kh\u1ea3 n\u0103ng ph\u00e2n t\u00edch ph\u1ea7n m\u1ec1m \u0111\u1ec3 theo d\u00f5i, \u0111\u1ed1i chi\u1ebfu, v\u00e0 x\u00e1c \u0111\u1ecbnh nguy\u00ean nh\u00e2n g\u1ed1c r\u1ec5 t\u1eeb nhi\u1ec1u ngu\u1ed3n (Google Analytics, CRM, v.v.).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">H\u1ecd c\u0169ng mu\u1ed1n th\u1ea5y b\u1ea1n hi\u1ec3u r\u00f5 quy tr\u00ecnh ETL\/pipeline, c\u0169ng nh\u01b0 c\u00e1ch t\u1ed5 ch\u1ee9c v\u00e0 chu\u1ea9n ho\u00e1 d\u1eef li\u1ec7u trong m\u00f4i tr\u01b0\u1eddng c\u00f4ng ngh\u1ec7 cao, kh\u00e1c v\u1edbi m\u00f4i tr\u01b0\u1eddng \u00edt c\u00f4ng ngh\u1ec7 h\u01a1n (nh\u01b0 s\u1ea3n xu\u1ea5t ho\u1eb7c F&amp;B).<\/span><\/p>\n<p><b>C\u00e2u tr\u1ea3 l\u1eddi g\u1ee3i \u00fd:<\/b><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i b\u1eaft \u0111\u1ea7u b\u1eb1ng vi\u1ec7c xem log thu th\u1eadp d\u1eef li\u1ec7u, so s\u00e1nh timestamp gi\u1eefa c\u00e1c h\u1ec7 th\u1ed1ng. Ki\u1ec3m tra xem c\u00f3 tr\u1ec5 khi \u0111\u1ed3ng b\u1ed9 (sync) d\u1eef li\u1ec7u kh\u00f4ng.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Sau \u0111\u00f3, t\u00f4i \u0111\u1ed1i chi\u1ebfu k\u1ebft qu\u1ea3 v\u1edbi ngu\u1ed3n g\u1ed1c (v\u00ed d\u1ee5: Google Analytics, Salesforce) \u0111\u1ec3 xem li\u1ec7u c\u00f3 s\u1ef1 kh\u00e1c bi\u1ec7t trong \u0111\u1ecbnh ngh\u0129a \u201cuser\u201d, \u201cclick\u201d hay kh\u00f4ng.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i c\u0169ng \u0111\u1eb7t c\u00e2u h\u1ecfi v\u1ec1 phi\u00ean b\u1ea3n code ETL, v\u00ec \u0111\u00f4i khi thay \u0111\u1ed5i logic \u1edf pipeline c\u0169 ch\u01b0a \u0111\u01b0\u1ee3c c\u1eadp nh\u1eadt v\u00e0o pipeline m\u1edbi.<\/span><\/p>\n<h3><strong>B\u1ea1n \u0111\u00e1nh gi\u00e1 \u0111\u1ed9 tin c\u1eady c\u1ee7a k\u1ebft qu\u1ea3 ph\u00e2n t\u00edch b\u1eb1ng c\u00e1ch n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Tr\u01b0\u1edbc h\u1ebft, t\u00f4i d\u00f9ng th\u1ed1ng k\u00ea m\u00f4 t\u1ea3 (min, max, mean, median, count) \u0111\u1ec3 xem d\u1eef li\u1ec7u c\u00f3 h\u1ee3p l\u00fd kh\u00f4ng.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Sau \u0111\u00f3, n\u1ebfu l\u00e0 k\u1ebft qu\u1ea3 m\u00f4 h\u00ecnh d\u1ef1 \u0111o\u00e1n, t\u00f4i s\u1ebd chia d\u1eef li\u1ec7u train\/test \u0111\u1ec3 \u0111\u00e1nh gi\u00e1. C\u00f2n n\u1ebfu l\u00e0 ph\u00e2n t\u00edch A\/B Testing, t\u00f4i ki\u1ec3m \u0111\u1ecbnh p-value, significance level (th\u01b0\u1eddng l\u00e0 0.05).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i c\u0169ng c\u00f3 th\u1ec3 cross-check v\u1edbi m\u1ed9t ngu\u1ed3n d\u1eef li\u1ec7u kh\u00e1c (n\u1ebfu c\u00f3) \u0111\u1ec3 \u0111\u1ea3m b\u1ea3o k\u1ebft lu\u1eadn kh\u00f4ng b\u1ecb thi\u00ean l\u1ec7ch do m\u1ed9t ngu\u1ed3n duy nh\u1ea5t.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Cac_cau_hoi_phong_van_Data_Analyst_ve_Quan_ly_cong_viec_Quy_trinh_Truc_quan_hoa_du_lieu\"><\/span><b>C\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst v\u1ec1 Qu\u1ea3n l\u00fd c\u00f4ng vi\u1ec7c, Quy tr\u00ecnh &amp; Tr\u1ef1c quan ho\u00e1 d\u1eef li\u1ec7u<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<h3><strong>B\u1ea1n c\u00f3 kinh nghi\u1ec7m vi\u1ebft t\u00e0i li\u1ec7u m\u00f4 t\u1ea3 pipeline ph\u00e2n t\u00edch ch\u01b0a? N\u1ed9i dung ch\u00ednh l\u00e0 g\u00ec?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Th\u00f4ng th\u01b0\u1eddng, t\u00f4i vi\u1ebft t\u00e0i li\u1ec7u \u1edf d\u1ea1ng wiki n\u1ed9i b\u1ed9, bao g\u1ed3m c\u00e1c t\u00e0i li\u1ec7u m\u00f4 t\u1ea3:<\/span><\/p>\n<ul>\n<li><span style=\"font-weight: 400;\">(1) Ngu\u1ed3n d\u1eef li\u1ec7u (database, API, file CSV\u2026);<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(2) Quy tr\u00ecnh ETL (Extract, Transform, Load) v\u00e0 l\u1ecbch ch\u1ea1y;<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(3) Logic t\u00ednh c\u1ed9t m\u1edbi, KPI, metric;<\/span><\/li>\n<li><span style=\"font-weight: 400;\">(4) Bi\u1ec3u \u0111\u1ed3\/t\u00e0i li\u1ec7u h\u01b0\u1edbng d\u1eabn \u0111\u1ecdc report<\/span><\/li>\n<\/ul>\n<p><span style=\"font-weight: 400;\">Nh\u1eefng t\u00e0i li\u1ec7u n\u00e0y gi\u00fap team m\u1edbi v\u00e0o n\u1eafm b\u1eaft d\u1ec5, c\u0169ng h\u1ea1n ch\u1ebf sai s\u00f3t do hi\u1ec3u l\u1ea7m quy t\u1eafc t\u00ednh to\u00e1n.<\/span><\/p>\n<h3><strong>B\u1ea1n th\u01b0\u1eddng qu\u1ea3n l\u00fd c\u00f4ng vi\u1ec7c h\u1eb1ng ng\u00e0y tr\u00ean Jira\/Trello nh\u01b0 th\u1ebf n\u00e0o?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">Tr\u00ean Trello, t\u00f4i chia task th\u00e0nh c\u00e1c th\u1ebb (card) v\u1edbi ti\u00eau \u0111\u1ec1 r\u00f5 r\u00e0ng, v\u00ed d\u1ee5 \u201cX\u00e2y dashboard doanh thu tu\u1ea7n\u201d. M\u1ed7i th\u1ebb c\u00f3 m\u00f4 t\u1ea3, deadline, ai l\u00e0 ng\u01b0\u1eddi li\u00ean quan (watcher).<\/span><\/p>\n<p><span style=\"font-weight: 400;\">T\u00f4i th\u01b0\u1eddng xuy\u00ean c\u1eadp nh\u1eadt tr\u1ea1ng th\u00e1i (To Do -&gt; In Progress -&gt; Done). M\u1ed7i tu\u1ea7n, t\u00f4i review backlog \u0111\u1ec3 s\u1eafp x\u1ebfp \u0111\u1ed9 \u01b0u ti\u00ean.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">V\u1edbi Jira, t\u00f4i \u0111\u00f4i khi t\u1ea1o sub-task \u0111\u1ec3 t\u00e1ch ri\u00eang ph\u1ea7n ph\u00e2n t\u00edch code, ph\u1ea7n QA, ph\u1ea7n b\u00e1o c\u00e1o.<\/span><\/p>\n<h3><strong>B\u1ea1n t\u1eebng l\u00e0m vi\u1ec7c theo Agile\/Scrum? Vai tr\u00f2 c\u1ee7a DA trong team?<\/strong><\/h3>\n<p><span style=\"font-weight: 400;\">T\u00f4i t\u1eebng tham gia c\u00e1c sprint 2 tu\u1ea7n, trong \u0111\u00f3 Data Analyst (DA) th\u01b0\u1eddng \u0111\u1ea3m nh\u1eadn nhi\u1ec7m v\u1ee5 chu\u1ea9n b\u1ecb d\u1eef li\u1ec7u, ph\u00e2n t\u00edch insight \u0111\u1ec3 Product Owner c\u00f3 c\u01a1 s\u1edf quy\u1ebft \u0111\u1ecbnh t\u00ednh n\u0103ng m\u1edbi.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ed7i ng\u00e0y, t\u00f4i tham gia daily stand-up, demo k\u1ebft qu\u1ea3 ph\u00e2n t\u00edch trong sprint review.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Khi team c\u1ea7n s\u1ed1 li\u1ec7u hay dashboard c\u1eadp nh\u1eadt, t\u00f4i s\u1ebd t\u1ea1o user story \u0111\u1ec3 b\u00e1m s\u00e1t, tr\u00e1nh qu\u00ean task.<\/span><\/p>\n<h2><span class=\"ez-toc-section\" id=\"Tong_ket\"><\/span><b>T\u1ed5ng k\u1ebft<\/b><span class=\"ez-toc-section-end\"><\/span><\/h2>\n<p><span style=\"font-weight: 400;\">C\u1ea7n l\u01b0u \u00fd r\u1eb1ng \u1edf m\u1ed7i doanh nghi\u1ec7p, v\u1ecb tr\u00ed Data Analyst c\u00f3 th\u1ec3 \u201cbi\u1ebfn ho\u00e1\u201d tu\u1ef3 theo t\u00ednh ch\u1ea5t c\u00f4ng vi\u1ec7c v\u00e0 m\u1ee5c ti\u00eau kinh doanh. \u0110\u00f4i khi c\u00f4ng ty c\u00f2n \u0111\u00f2i h\u1ecfi b\u1ea1n ph\u1ea3i ki\u00eam lu\u00f4n m\u1ed9t ph\u1ea7n Data Engineering (DE) ho\u1eb7c Data Science (DS) \u0111\u1ec3 ph\u1ee5c v\u1ee5 cho m\u1ed9t d\u1ef1 \u00e1n c\u1ee5 th\u1ec3. V\u00ec v\u1eady, \u1ee9ng vi\u00ean n\u00ean chu\u1ea9n b\u1ecb k\u1ef9 \u0111\u1ec3 qu\u1ea3n l\u00fd t\u1ed1t k\u1ef3 v\u1ecdng c\u1ee7a c\u1ea3 hai b\u00ean trong bu\u1ed5i ph\u1ecfng v\u1ea5n.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">M\u1ed9t bu\u1ed5i ph\u1ecfng v\u1ea5n t\u1ed1t kh\u00f4ng ch\u1ec9 \u0111\u1ebfn t\u1eeb vi\u1ec7c \u201c\u1ee9ng vi\u00ean gi\u1ecfi h\u1ecfi-\u0111\u00e1p\u201d m\u00e0 c\u00f2n t\u1eeb s\u1ef1 chu\u1ea9n b\u1ecb k\u1ef9 l\u01b0\u1ee1ng c\u1ea3 hai ph\u00eda. Hy v\u1ecdng 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst tr\u00ean s\u1ebd gi\u00fap b\u1ea1n b\u01b0\u1edbc v\u00e0o bu\u1ed5i ph\u1ecfng v\u1ea5n Data Analyst v\u1edbi s\u1ef1 t\u1ef1 tin, hi\u1ec3u bi\u1ebft v\u00e0 t\u1ea7m nh\u00ecn r\u1ed9ng h\u01a1n v\u1ec1 gi\u00e1 tr\u1ecb m\u00e0 Data Analyst mang l\u1ea1i cho doanh nghi\u1ec7p. Ch\u00fac b\u1ea1n th\u00e0nh c\u00f4ng!<\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Theo b\u00e1o c\u00e1o \u201cJobs on the Rise\u201d c\u1ee7a LinkedIn, Data Analyst n\u1eb1m trong Top 10 c\u00f4ng vi\u1ec7c t\u0103ng tr\u01b0\u1edfng m\u1ea1nh nh\u1ea5t to\u00e0n c\u1ea7u v\u1edbi m\u1ee9c t\u0103ng 25% m\u1ed7i n\u0103m \u1edf nhi\u1ec1u th\u1ecb tr\u01b0\u1eddng ph\u00e1t tri\u1ec3n. Th\u00eam v\u00e0o \u0111\u00f3, th\u1ebf gi\u1edbi c\u00f3 th\u1ec3 t\u1ea1o ra t\u1edbi 463 exabyte d\u1eef li\u1ec7u m\u1ed7i ng\u00e0y v\u00e0o n\u0103m 2025 \u2013 [&hellip;]<\/p>\n","protected":false},"author":222,"featured_media":84174,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_gspb_post_css":"","footnotes":""},"categories":[109,105],"tags":[],"class_list":["post-84096","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-chuyen-mon-it","category-phong-van-it"],"blocksy_meta":[],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v26.8 (Yoast SEO v27.8) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p - ITviec Blog<\/title>\n<meta name=\"description\" content=\"T\u1ed5ng h\u1ee3p c\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p t\u1eeb l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u, t\u01b0 duy ph\u00e2n t\u00edch kinh doanh, \u0111\u1ebfn thi\u1ebft k\u1ebf v\u00e0 \u0111o l\u01b0\u1eddng ch\u1ec9 s\u1ed1.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/\" \/>\n<meta property=\"og:locale\" content=\"vi_VN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p\" \/>\n<meta property=\"og:description\" content=\"Theo b\u00e1o c\u00e1o \u201cJobs on the Rise\u201d c\u1ee7a LinkedIn, Data Analyst n\u1eb1m trong Top 10 c\u00f4ng vi\u1ec7c t\u0103ng tr\u01b0\u1edfng m\u1ea1nh nh\u1ea5t to\u00e0n c\u1ea7u v\u1edbi m\u1ee9c t\u0103ng 25% m\u1ed7i n\u0103m \u1edf nhi\u1ec1u th\u1ecb\" \/>\n<meta property=\"og:url\" content=\"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/\" \/>\n<meta property=\"og:site_name\" content=\"ITviec Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/ITviec\" \/>\n<meta property=\"article:published_time\" content=\"2025-01-17T10:51:22+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2026-04-06T03:20:39+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1500\" \/>\n\t<meta property=\"og:image:height\" content=\"790\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Nguy\u1ec5n H\u1eefu V\u0103n\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@ITviec\" \/>\n<meta name=\"twitter:site\" content=\"@ITviec\" \/>\n<meta name=\"twitter:label1\" content=\"\u0110\u01b0\u1ee3c vi\u1ebft b\u1edfi\" \/>\n\t<meta name=\"twitter:data1\" content=\"Nguy\u1ec5n H\u1eefu V\u0103n\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u01af\u1edbc t\u00ednh th\u1eddi gian \u0111\u1ecdc\" \/>\n\t<meta name=\"twitter:data2\" content=\"34 ph\u00fat\" \/>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p - ITviec Blog","description":"T\u1ed5ng h\u1ee3p c\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p t\u1eeb l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u, t\u01b0 duy ph\u00e2n t\u00edch kinh doanh, \u0111\u1ebfn thi\u1ebft k\u1ebf v\u00e0 \u0111o l\u01b0\u1eddng ch\u1ec9 s\u1ed1.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/","og_locale":"vi_VN","og_type":"article","og_title":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p","og_description":"Theo b\u00e1o c\u00e1o \u201cJobs on the Rise\u201d c\u1ee7a LinkedIn, Data Analyst n\u1eb1m trong Top 10 c\u00f4ng vi\u1ec7c t\u0103ng tr\u01b0\u1edfng m\u1ea1nh nh\u1ea5t to\u00e0n c\u1ea7u v\u1edbi m\u1ee9c t\u0103ng 25% m\u1ed7i n\u0103m \u1edf nhi\u1ec1u th\u1ecb","og_url":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/","og_site_name":"ITviec Blog","article_publisher":"https:\/\/www.facebook.com\/ITviec","article_published_time":"2025-01-17T10:51:22+00:00","article_modified_time":"2026-04-06T03:20:39+00:00","og_image":[{"width":1500,"height":790,"url":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg","type":"image\/jpeg"}],"author":"Nguy\u1ec5n H\u1eefu V\u0103n","twitter_card":"summary_large_image","twitter_creator":"@ITviec","twitter_site":"@ITviec","twitter_misc":{"\u0110\u01b0\u1ee3c vi\u1ebft b\u1edfi":"Nguy\u1ec5n H\u1eefu V\u0103n","\u01af\u1edbc t\u00ednh th\u1eddi gian \u0111\u1ecdc":"34 ph\u00fat"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#article","isPartOf":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/"},"author":{"name":"Nguy\u1ec5n H\u1eefu V\u0103n","@id":"https:\/\/itviec.com\/blog\/#\/schema\/person\/a77cc13f89eaa58f59d8772448febe5f"},"headline":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p","datePublished":"2025-01-17T10:51:22+00:00","dateModified":"2026-04-06T03:20:39+00:00","mainEntityOfPage":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/"},"wordCount":9277,"publisher":{"@id":"https:\/\/itviec.com\/blog\/#organization"},"image":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#primaryimage"},"thumbnailUrl":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg","articleSection":["Chuy\u00ean m\u00f4n IT","Ph\u1ecfng v\u1ea5n IT"],"inLanguage":"vi"},{"@type":"WebPage","@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/","url":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/","name":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p - ITviec Blog","isPartOf":{"@id":"https:\/\/itviec.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#primaryimage"},"image":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#primaryimage"},"thumbnailUrl":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg","datePublished":"2025-01-17T10:51:22+00:00","dateModified":"2026-04-06T03:20:39+00:00","description":"T\u1ed5ng h\u1ee3p c\u00e1c c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p t\u1eeb l\u00e0m s\u1ea1ch d\u1eef li\u1ec7u, t\u01b0 duy ph\u00e2n t\u00edch kinh doanh, \u0111\u1ebfn thi\u1ebft k\u1ebf v\u00e0 \u0111o l\u01b0\u1eddng ch\u1ec9 s\u1ed1.","breadcrumb":{"@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#breadcrumb"},"inLanguage":"vi","potentialAction":[{"@type":"ReadAction","target":["https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/"]}]},{"@type":"ImageObject","inLanguage":"vi","@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#primaryimage","url":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg","contentUrl":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2025\/01\/cau-hoi-phong-van-Data-Analyst-vippro.jpg","width":1500,"height":790,"caption":"ca\u0302u ho\u0309i pho\u0309ng va\u0302\u0301n Data Analyst - itviec blog"},{"@type":"BreadcrumbList","@id":"https:\/\/itviec.com\/blog\/cau-hoi-phong-van-data-analyst\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Chuy\u00ean m\u00f4n IT","item":"https:\/\/itviec.com\/blog\/chuyen-mon-it\/"},{"@type":"ListItem","position":2,"name":"Top 30+ c\u00e2u h\u1ecfi ph\u1ecfng v\u1ea5n Data Analyst th\u01b0\u1eddng g\u1eb7p"}]},{"@type":"WebSite","@id":"https:\/\/itviec.com\/blog\/#website","url":"https:\/\/itviec.com\/blog\/","name":"ITviec Blog","description":"IT Jobs &amp; People in Vietnam","publisher":{"@id":"https:\/\/itviec.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/itviec.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"vi"},{"@type":"Organization","@id":"https:\/\/itviec.com\/blog\/#organization","name":"ITviec","url":"https:\/\/itviec.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"vi","@id":"https:\/\/itviec.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2018\/12\/itviec-black-square-facebook.png","contentUrl":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2018\/12\/itviec-black-square-facebook.png","width":1800,"height":1800,"caption":"ITviec"},"image":{"@id":"https:\/\/itviec.com\/blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/ITviec","https:\/\/x.com\/ITviec","https:\/\/www.linkedin.com\/company\/itviec","https:\/\/www.youtube.com\/channel\/UCYthAQ3bcGr57M_ag5gHDvQ"]},{"@type":"Person","@id":"https:\/\/itviec.com\/blog\/#\/schema\/person\/a77cc13f89eaa58f59d8772448febe5f","name":"Nguy\u1ec5n H\u1eefu V\u0103n","image":{"@type":"ImageObject","inLanguage":"vi","@id":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2024\/03\/TR-Nguyen-Huu-Van-vippro-e1712136004193-100x100.jpg","url":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2024\/03\/TR-Nguyen-Huu-Van-vippro-e1712136004193-100x100.jpg","contentUrl":"https:\/\/itviec.com\/blog\/wp-content\/uploads\/2024\/03\/TR-Nguyen-Huu-Van-vippro-e1712136004193-100x100.jpg","caption":"Nguy\u1ec5n H\u1eefu V\u0103n"},"url":"https:\/\/itviec.com\/blog\/author\/nguyen-huu-van-2\/"}]}},"_links":{"self":[{"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/posts\/84096","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/users\/222"}],"replies":[{"embeddable":true,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/comments?post=84096"}],"version-history":[{"count":1,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/posts\/84096\/revisions"}],"predecessor-version":[{"id":95448,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/posts\/84096\/revisions\/95448"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/media\/84174"}],"wp:attachment":[{"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/media?parent=84096"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/categories?post=84096"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/itviec.com\/blog\/wp-json\/wp\/v2\/tags?post=84096"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}