Cach Thay Doi Giong Noi Cua Ban Qua Bat Ki Microphone Nao: Huong Dan Toan Van

Tim hieu cach thay doi giong noi cua ban qua bat ki microphone nao, pitch, formant, resonance duoc giai thich, chuoi tin hieu WASAPI, va huong dan tung buoc cho Discord, Zoom, OBS, va trong tro choi.

Cach Thay Doi Giong Noi Cua Ban Qua Bat Ki Microphone Nao: Huong Dan Toan Van

Thay doi giong noi cua ban qua microphone don gian hon nhung huong dan khong lam cho dung day, nhung chi khi ban hieu phan mem thuc te dang lam gi. Huong dan nay bao gom cac kien thuc co ban am thanh (pitch, formant, resonance), chuoi tin hieu am thanh Windows, va cau hinh tung buoc cho Discord, Zoom, OBS, va obrolan suara trong tro choi.


TL;DR

  • Thay doi giong noi hoat dong bang cach chiem duoc tin hieu microphone cua ban trong phan mem, truoc khi bat ky ung dung nao thay dieu do
  • Chi dich chuyen pitch nghe giong nhu robot, ket hop voi dich chuyen formant cho ket qua tu nhien
  • WASAPI la API am thanh cap thap Windows cho phep latency xu ly duoi 20 ms
  • Dau ra dinh tuyen toi mot microphone ao ma cac ung dung cua ban lua chon thay vi tru thuc
  • Thiet lap la cung mo hinh cho moi ung dung: lua chon microphone ao lam input
  • VoxBooster xu ly WASAPI, AI voice cloning, va dinh tuyen ao trong mot cai dat, duoi 300 ms end-to-end tren bat ky may Windows 10/11

1. Dieu Thuc Tay Xay Ra Khi Ban “Thay Doi Giong Noi Cua Ban”

Giong noi cua ban la mot tin hieu am thanh phuc tap. Ba dac tinh xac dinh cach do nghe:

Pitch (F0, tan so co ban) Pitch la toc do ma cac day thanh cua ban rung. Cac nam thanh nien trung binh khoang 85-180 Hz; cac nu thanh nien trung binh khoang 165-255 Hz. Nang pitch theo mot octave nhan doi F0; ha no chia F0.

Formants Formants la cac dinh resonant duoc tao boi ong xa tieng cua ban (hong, mieng, khoang mui) tao hinh ten dau tien tu day thanh cua ban. F1 va F2 la nhung dieu quan trong nhat ve mat nhan thuc, chung xac dinh am thanh vokal va cho tieng noi dac tinh cua no. Giong baritone va tenor hat cung mot not tai cung mot pitch van nghe khac boi vi formants cua ho khac nhau.

Spectral envelope Phan bo nang luong tong the tren tat ca cac tan so, dieu lam cho tieng noi nghe “am ap”, “mui”, “tho”, hoac “sac.”

Mot may dich chuyen pitch co ban di F0 ma khong cam formants. Day la ly do tai sao cac phan mem thay doi giong noi re tien nghe giong nhu con so hoac quai vat gua, co ban di nhung resonance nam o vi tri sai. Phan mem thay doi giong noi level chuyenp thiet lap nhan doi pitch va formants doc lap va dieu chinh spectral envelope de phu hop voi hille suara dich. Su ket hop do la dieu tao nen mot giong noi khac biet da tin cay thay vi mot giong noi ro rang duoc xu ly.


2. Chuoi Tin Hieu WASAPI Tren Windows

Hieu duong dan tin hieu giup ban cau hinh tat ca dung cach va tan doan van de.

Microphone vat ly

Trai xuat am thanh Windows (WASAPI)

Phan mem thay doi giong noi (capture loop)
     → may dich chuyen pitch
     → may dich chuyen formant
     → chuoi hieu ung (EQ, reverb, noise gate)

Thiet bi am thanh ao (microphone ao)

Ung dung dich (Discord / Zoom / OBS / tro choi)

Tai sao WASAPI quan trong

Windows co hai giao dien am thanh chinh: DirectSound (cu, tren cao) va WASAPI (Windows Audio Session API, gioi thieu o Vista). WASAPI co the chay trong hai che do:

  • Che do chung, dong co am thanh Windows hon hop nhieu dong. Them mot buffer hon hop (thuong 10-20 ms) nhung cho phep cac ung dung khac su dung thiet bi do dong thoi.
  • Che do exclusive, ung dung chiem quyen so huu truc tiep cua giao dien phan cung. Khong co latency mixer, nhung khong co ung dung khac co the su dung thiet bi do dong thoi.

Cac phan mem thay doi giong noi thuong chay che do chung WASAPI o phia capture (doc cac mau microphone cua ban) va tao mot thiet bi WDM/MME ao cho dau ra, microphone ao. Dieu nay cho phep Discord, Zoom, va cac ung dung khac lay no qua khai bao am thanh Windows binh thuong.

Phan tich latency tong (desktop dien hinh, phan cung 2024)

Giai DoanLatency Dien Hinh
Analog microphone > digital (ADC)1-3 ms
Buffer capture WASAPI5-10 ms
Xu ly (pitch + formant)10-30 ms
Buffer dau ra thiet bi ao5-10 ms
Nhan ung dung1-5 ms
Tong Cong~22-58 ms

Duoi 50 ms khong con thay trong obrolan suara. Duoi 100 ms la chap nhan duoc. Phan mem yeu cau driver che do nhan hoac buffer DSP lon co the day dieu nay tren 150 ms, co tro nen de thay trong cuoc tro chuyen.


3. Chon Phan Mem Thay Doi Giong Noi Chinh Xac

Truoc khi vao chi tiet setup cho moi ung dung, hay chon phan mem phu hop voi truong hop su dung cua ban:

Cho su dung casual / streaming / gaming: Mot phan mem thay doi giong noi real-time voi thu vien preset va dau ra microphone ao. Tim kiem ho tro WASAPI va dich chuyen formant, khong chi pitch.

Cho noi dung chuyenp / tieng noi doc dao: AI voice cloning, anh xa phat am cua ban vao mot mo hinh tieng noi duoc huan luyen o thoi gian thuc. Latency cao hon mot chut (duoi 300 ms voi cac may hien dai) nhung ket qua khong phan biet duoc voi tieng noi da ghi.

Cho latency thap nhat tuyet doi: WASAPI exclusive mode nguyen ban + kich thuoc buffer nho (128 mau o 48 kHz = 2,67 ms moi lan pass buffer). Chi quan trong cho chi pal live hoac su dung san khau, khong can cho Discord hoac gaming.

Cac tinh nang chu de de kiem tra truoc khi cai dat:

  • Tao mot microphone ao xuat hien trong cai dat Am thanh Windows
  • Khong can driver kernel (driver kernel co the xung dot voi phan mem anti-cheat trong tro choi)
  • Chay tren Windows 10 va Windows 11 ma khong can cai dat Visual C++ them
  • Ho tro capture WASAPI (khong chi polling WDM/MME)

VoxBooster cai dat mot thiet bi am thanh ao WDM da ky va xu ly qua WASAPI, ma khong co driver che do nhan. No hoat dong tren Windows 10 va Windows 11 va them AI voice cloning tren dau cac hieu ung pitch/formant chuan.


4. Tung Buoc: Thiet Lap Cho Discord

Discord la truong hop su dung pho bien nhat va de cau hinh nhat.

Buoc 1, Cai dat va phat dong phan mem thay doi giong noi cua ban

Chay installer va phat dong phan mem. Xac nhan rang no xuat hien trong system tray Windows va am thanh chay (meter input phai phan ung khi ban noi).

Buoc 2, Xac minh microphone ao tren Windows

Mo Settings > System > Sound > More sound settings (hoac nhan phai bieu tuong speaker tray > Sounds > tab Recording). Ban se thay mot thiet bi ghi am moi, thuong co ten giong nhu “VoxBooster Virtual Microphone” hoac tuong tu. Neu no xuat hien “Not plugged in,” hay khoi dong lai dich vu thay doi giong noi.

Buoc 3, Tat microphone vat ly cua ban trong mixer Windows

Nhan phai microphone vat ly cua ban trong tab Recording > Disable. Dieu nay nggan Discord tu khi cung ghi am thanh tho thuc tu microphone thuc cua ban dong thoi. Ban co the kich hoat lai khi ket thuc.

Buoc 4, Cau hinh Discord

Mo User Settings > Voice & Video. O duoi Input Device, lua chon microphone ao tu dropdown. Dat Input Mode thanh Voice Activity va dieu chinh slider nhay cam den khi Discord chi kich hoat khi ban noi.

Buoc 5, Kiem Tra

Su dung Let’s Check echo test trong cai dat Voice & Video cua Discord, hoac tham gia mot server tu nhan voi ban be. Xac nhan ho nghe giong noi duoc xu ly, khong phai ban dau.

Tan doan am thanh lac Discord: Neu nhung nguoi khac nghe ban hai lan, microphone vat ly cua ban van dang kich hoat tren Windows, hay kiem tra lai Buoc 3.


5. Tung Buoc: Thiet Lap Cho Zoom

Zoom them mot cap xu ly am thanh cua no (sap xep nhiet do tu dong, huy cam hieu ung) co the can tro dau ra phan mem thay doi giong noi.

Buoc 1, Hoan thanh cac Buoc 1-3 tu phan Discord o tren (cai dat, xac minh microphone ao, tat microphone vat ly tren Windows).

Buoc 2, Cau hinh Zoom

Mo Settings > Audio. O duoi Microphone, lua chon microphone ao. Nhan Test Mic de xac nhan muc duoc ghi.

Buoc 3, Tat xu ly am thanh Zoom

Dieu nay quan trong: mo Settings > Audio > Advanced va dat:

  • Suppress background noise > Thap (hoac Tat)
  • Suppress intermittent noise > Tat
  • Echo cancellation > Tu dong

Sap xep nhiet do anh huong Zoom coi cac hieu ung phan mem thay doi giong noi la “nhiet do” va loc chung ra, lam giam hieu ung. Dat sap xep thanh Thap hoac Tat cho phep am thanh duoc xu ly qua sach se.

Buoc 4, Kiem Tra

Su dung Test Speaker & Microphone trong cai dat Audio Zoom, hoac bat dau mot cuoc hop kiem tra. Xac minh giong noi da bien duoi nghe sach se ma khong co hieu ung.


6. Tung Buoc: Thiet Lap Cho OBS

OBS (Open Broadcaster Software) duoc su dung de streaming va ghi. No xu ly cac nguon am thanh khac voi cac ung dung giao tiep, no ghi am thanh lam mot nguon thay vi lua chon mot thiet bi input toan he thong.

Buoc 1, Cai dat phan mem thay doi giong noi va xac minh microphone ao (Buoc 1-2 tu phan Discord).

Buoc 2, Them microphone ao la mot nguon Audio Input Capture trong OBS

Trong OBS, mo Sources > Add > Audio Input Capture. Dat ten (v.d., “Voice Changer”). Trong dropdown thiet bi, lua chon microphone ao.

Buoc 3, Xoa hoac cam tieng nguon microphone vat ly cua ban

Neu ban truoc day co mot nguon microphone trong OBS chi tro toi microphone thuc cua ban, hay cam tieng hoac xoa de tranh nhan doi.

Buoc 4, Them bo loc Noise Gate (tuy chon nhung duoc khuyen nghi)

Nhan phai nguon Audio Input Capture > Filters > Add > Noise Gate. Dat gioi han dong khoang -50 dB va gioi han mo khoang -40 dB. Dieu nay nggan bat ky hieu ung xu ly nao trong im lang xuat hien trong ban ghi.

Buoc 5, Giam sat trong OBS

Nhan phai nguon am thanh > Advanced Audio Settings > kich hoat Monitor and Output de nghe giong noi duoc xu ly qua tai nghe cua ban o thoi gian thuc khi ghi hoac streaming.


7. Tung Buoc: Obrolan Suara Trong Tro Choi

Hau het cac tro choi (Valorant, Fortnite, Counter-Strike, v.v.) su dung thiet bi giao tiep mac dinh Windows hoac cho phep ban chon mot thiet bi input trong cai dat am thanh cua tro choi.

Tuy Chon A, Dat lam thiet bi giao tiep mac dinh

Trong Windows Sound > tab Recording, nhan phai microphone ao > Set as Default Communication Device. Cac tro choi tu dong lua chon thiet bi giao tiep se su dung no.

Tuy Chon B, Dat trong tro choi

Mo am thanh hoac cai dat giong noi cua tro choi. Tim dropdown microphone/voice input va lua chon microphone ao theo ten. Dieu nay ghi de mau dinh Windows cho tro choi do cu the.

Xem xet anti-cheat

Mot so he thong anti-cheat (Vanguard, EAC) giam sat driver che do nhan. Mot phan mem thay doi giong noi cai dat o ring-0 (driver nhan) co the gat co anti-cheat. Phan mem chay lam mot ung dung khong gian nguoi dung voi mot thiet bi am thanh ao WDM da ky, ma khong co driver kernel, tranh van de nay hoan toan.

Latency trong tro choi

Obrolan suara trong tro choi them latency mang rieng tren tren latency phan mem thay doi giong noi dia phuong. Phan xu ly dia phuong (microphone > microphone ao) phai giu duoi 50 ms; phan mang ngoai tam soat cua ban. Tong did tan duoc cam thuc phu thuoc vao server ping, khong chinh xac la tren phan mem thay doi giong noi.


8. Dinh Chon Giong Noi: Pitch, Formant, va Hieu Ung

Sau khi dinh tuyen hoat dong, chat luong cua chuyen doi phu thuoc vao cach ban dieu chinh cac tham so.

Dich chuyen pitch

Hau het cac giong noi tu nhien nam trong ±12 semitone (mot octave) cua pitch ban dau cua ho. Vuot qua dieu do, hieu ung tro nen de thay. Doi voi mot dich chuyen nam > nu thuat thuc, hay thu +5 toi +8 semitone. Doi voi nu > nam, hay thu -4 toi -6 semitone.

Dich chuyen formant

Dich chuyen formant di chuyen cac resonance cua ong xa tieng cua ban doc lap voi pitch. Nang formant de nghe tre hon/nho hon; ha cho de nghe lon hon/sau hon. Mot diem bat dau tot cho mot giong noi da duoc dich chuyen pitch la nang formant +1 toi +2 semitone de khop.

Noise gate

Dat noise gate de dong o -55 dB de nggan thuat toan xu ly nhiet do xung quanh hoac am thanh tho. Dieu nay giu dau ra sach se trong im lang.

Reverb va EQ

Reverb phong vua phuc (han rut 0,3-0,5 s) co the che giau hieu ung dich chuyen pitch. Mot tang high-shelf nho (+2 dB tren 8 kHz) them ro rang. Tranh reverb lon trong boi canh giao tiep, no lam cho ban nghe giong nhu ban o trong mot hang dong.

AI voice cloning

Neu phan mem cua ban ho tro cac mo hinh giong noi AI, phuong phap dieu chinh khac: thay vi dieu chinh pitch va formant ban tay, ban lua chon mot mo hinh giong noi duoc huan luyen va dieu chinh cuong do chuyen doi (bao nhieu phan mem dua phat am cua ban huong toi giong noi dich). Bat dau o cuong do 70-80%, qua cao gay hieu ung tren phat am nhanh; qua thap cho phep giong noi ban dau tham nhap.


9. Tan doan van de Pho Bien

“Ung dung khong thay microphone ao” Khoi dong lai dich vu thay doi giong noi, sau do mo lai ung dung dich. Mot so ung dung cache danh sach thiet bi khi khoi dong va se khong phat hien thiet bi moi them sau.

“Giong noi nghe robotics hoac kim loai” Pitch da dich chuyen nhung formant khong phai. Kich hoat bao toan formant hoac dieu chinh slider dich chuyen formant de xap xi khop huong dich chuyen pitch.

“Ghe hop hoac giong noi kep trong Discord” Microphone vat ly hoat dong cung voi thiet bi ao. Tat hoac cam tieng microphone vat ly trong Windows Sound > Recording.

“Sap xep nhiet do Zoom dang giet hieu ung” Dat sap xep am thanh Zoom thanh Thap hoac Tat (Settings > Audio > Advanced).

“Phan mem thay doi giong noi gay ra tro choi crash hoac cam anti-cheat” Phan mem su dung mot driver che do nhan. Chuyen sang mot phan mem thay doi giong noi khong gian nguoi dung voi mot thiet bi ao WDM da ky.

“Latency cao, tam dung ro rang khi noi” Tang ukuran buffer WASAPI trong cai dat phan mem thay doi giong noi (buffer nho hon = latency thap hon nhung rui ro CPU cao hon). Hoac, dong cac ung dung am thanh canh tranh su dung cung thiet bi WASAPI.


Ket Luan

Thay doi giong noi cua ban qua microphone tren Windows huong toi bon dieu: hieu duong cac dac tinh am thanh ma ban dang tao kieu dung (pitch, formant, resonance), dinh tuyen tin hieu qua mot ung dung thay doi giong noi qua WASAPI, dau ra no toi mot microphone ao, va lua chon microphone ao do o moi ung dung dich. Setup tung ung dung tro nen gan nhu y hoc sau khi ban hieu duong khong kho so thiet lap.

Phan kho nhat thuong lam cho chuyen doi nghe tu nhien, va dieu do yeu cau dich chuyen formant cung voi dich chuyen pitch, khong chi mot offset tan so don gian.

Cho tat ca o mot noi, xu ly WASAPI, AI cloning, dinh tuyen ao, khong co driver kernel, tuong thich voi Windows 10 va 11, VoxBooster dang gia thu trong phien hoc tiep theo.

Dùng thử VoxBooster — 3 ngày dùng thử miễn phí.

Nhân bản giọng thời gian thực, soundboard và hiệu ứng — ở mọi nơi bạn đã nói chuyện.

  • Không cần thẻ tín dụng
  • ~30ms độ trễ
  • Discord · Teams · OBS
Dùng thử miễn phí 3 ngày