第16章測試與調試¶

軟件測試是確保代碼質量的重要環節，就像我們在聲紋識別項目中需要驗證模型的準確性一樣，Python代碼也需要通過測試來保證其功能正確性和穩定性。本章將詳細介紹Python中的測試框架、調試技術和性能優化方法，幫助你寫出更可靠的代碼。

16.1 軟件測試基礎¶

測試的重要性¶

軟件測試就像工廠的質量檢驗一樣，是保證產品質量的關鍵環節。在實際項目開發中，測試不僅能發現bugs，更重要的是能夠：

提高代碼質量：通過測試發現邏輯錯誤和邊界條件問題
增強重構信心：有了完善的測試，重構代碼時不用擔心破壞現有功能
文檔化作用：測試用例本身就是代碼行爲的最好文檔
降低維護成本：早期發現問題比後期修復成本更低

測試類型¶

軟件測試按照測試範圍可以分爲幾個層次，形成所謂的”測試金字塔”：

單元測試（Unit Test）：測試最小的代碼單元，通常是單個函數或方法
集成測試（Integration Test）：測試多個模塊之間的協作
系統測試（System Test）：測試整個系統的功能
驗收測試（Acceptance Test）：驗證系統是否滿足業務需求

測試驅動開發（TDD）¶

TDD是一種開發方法論，遵循”紅-綠-重構”的循環：
1. 紅：先寫測試，運行測試（失敗，因爲功能還沒實現）
2. 綠：編寫最少的代碼讓測試通過
3. 重構：優化代碼結構，保持測試通過

16.2 unittest框架¶

unittest是Python標準庫中的測試框架，提供了完整的測試功能。讓我們通過一個計算器的例子來學習unittest的使用。

unittest基礎¶

首先創建一個簡單的計算器類：

# test_calculator.py
class Calculator:
    def add(self, a, b):
        """加法運算"""
        return a + b

    def subtract(self, a, b):
        """減法運算"""
        return a - b

    def multiply(self, a, b):
        """乘法運算"""
        return a * b

    def divide(self, a, b):
        """除法運算"""
        if b == 0:
            raise ValueError("Cannot divide by zero")
        return a / b

接下來編寫unittest測試：

import unittest
from test_calculator import Calculator

class TestCalculator(unittest.TestCase):
    """Calculator類的unittest測試"""

    def setUp(self):
        """每個測試方法執行前調用"""
        self.calculator = Calculator()
        print(f"設置測試環境: {self._testMethodName}")

    def tearDown(self):
        """每個測試方法執行後調用"""
        print(f"清理測試環境: {self._testMethodName}")

    def test_add(self):
        """測試加法功能"""
        result = self.calculator.add(3, 5)
        self.assertEqual(result, 8)

        # 測試負數
        result = self.calculator.add(-1, 1)
        self.assertEqual(result, 0)

    def test_divide_by_zero(self):
        """測試除零異常"""
        with self.assertRaises(ValueError) as context:
            self.calculator.divide(10, 0)

        self.assertEqual(str(context.exception), "Cannot divide by zero")

if __name__ == '__main__':
    unittest.main()

運行unittest測試¶

運行上面的測試，你會看到如下輸出：

python test_unittest_example.py

test_add (__main__.TestCalculator.test_add)
測試加法功能 ... 設置測試環境: test_add
清理測試環境: test_add
ok
test_divide (__main__.TestCalculator.test_divide)
測試除法功能 ... 設置測試環境: test_divide
清理測試環境: test_divide
ok
test_divide_by_zero (__main__.TestCalculator.test_divide_by_zero)
測試除零異常 ... 設置測試環境: test_divide_by_zero
清理測試環境: test_divide_by_zero
ok
test_factorial (__main__.TestCalculator.test_factorial)
測試階乘功能 ... 設置測試環境: test_factorial
清理測試環境: test_factorial
ok
test_factorial_negative (__main__.TestCalculator.test_factorial_negative)
測試負數階乘異常 ... 設置測試環境: test_factorial_negative
清理測試環境: test_factorial_negative
ok
test_multiply (__main__.TestCalculator.test_multiply)
測試乘法功能 ... 設置測試環境: test_multiply
清理測試環境: test_multiply
ok
test_power (__main__.TestCalculator.test_power)
測試冪運算 ... 設置測試環境: test_power
清理測試環境: test_power
ok
test_subtract (__main__.TestCalculator.test_subtract)
測試減法功能 ... 設置測試環境: test_subtract
清理測試環境: test_subtract
ok

----------------------------------------------------------------------
Ran 8 tests in 0.003s

OK

斷言方法詳解¶

unittest提供了豐富的斷言方法：

# 基本斷言
self.assertEqual(a, b)        # a == b
self.assertNotEqual(a, b)     # a != b
self.assertTrue(x)            # bool(x) is True
self.assertFalse(x)           # bool(x) is False
self.assertIs(a, b)           # a is b
self.assertIsNone(x)          # x is None

# 異常斷言
self.assertRaises(ValueError, func, *args)
with self.assertRaises(ValueError):
    func(*args)

# 近似值斷言
self.assertAlmostEqual(a, b, places=7)  # 約等於
self.assertGreater(a, b)                # a > b
self.assertLess(a, b)                   # a < b

# 容器斷言
self.assertIn(a, b)                     # a in b
self.assertCountEqual(a, b)             # 列表元素相同（忽略順序）

測試組織¶

unittest支持多種測試組織方式：

class TestCalculatorAdvanced(unittest.TestCase):
    """Calculator類的高級測試"""

    @classmethod
    def setUpClass(cls):
        """整個測試類開始前調用一次"""
        print("開始Calculator高級測試")
        cls.calculator = Calculator()

    @classmethod
    def tearDownClass(cls):
        """整個測試類結束後調用一次"""
        print("完成Calculator高級測試")

# 創建測試套件
if __name__ == '__main__':
    suite = unittest.TestSuite()
    suite.addTest(unittest.TestLoader().loadTestsFromTestCase(TestCalculator))
    suite.addTest(unittest.TestLoader().loadTestsFromTestCase(TestCalculatorAdvanced))

    runner = unittest.TextTestRunner(verbosity=2)
    runner.run(suite)

16.3 pytest框架¶

pytest是Python生態中最流行的第三方測試框架，它提供了更簡潔的語法和更強大的功能。相比unittest，pytest更易於使用，有着豐富的插件生態系統。

安裝pytest¶

首先安裝pytest框架：

pip install pytest

Looking in indexes: https://mirrors.aliyun.com/pypi/simple
Requirement already satisfied: pytest in c:\users\15696\anaconda3\lib\site-packages (7.4.4)
Requirement already satisfied: iniconfig in c:\users\15696\anaconda3\lib\site-packages (from pytest) (1.1.1)
Requirement already satisfied: packaging in c:\users\15696\anaconda3\lib\site-packages (from pytest) (24.2)
Requirement already satisfied: pluggy<2.0,>=0.12 in c:\users\15696\anaconda3\lib\site-packages (from pytest) (1.0.0)
Requirement already satisfied: colorama in c:\users\15696\anaconda3\lib\site-packages (from pytest) (0.4.6)

pytest基礎使用¶

pytest的測試發現規則非常簡單：
- 測試文件以test_開頭或_test.py結尾
- 測試函數以test_開頭
- 測試類以Test開頭

讓我們用pytest重寫計算器測試：

import pytest
from test_calculator import Calculator

# Fixture定義
@pytest.fixture
def calculator():
    """創建Calculator實例的fixture"""
    print("\n創建Calculator實例")
    calc = Calculator()
    yield calc
    print("\n清理Calculator實例")

# 基礎測試
def test_add(calculator):
    """測試加法功能"""
    assert calculator.add(3, 5) == 8
    assert calculator.add(-1, 1) == 0
    assert calculator.add(0, 0) == 0

def test_divide_by_zero(calculator):
    """測試除零異常"""
    with pytest.raises(ValueError, match="Cannot divide by zero"):
        calculator.divide(10, 0)

運行pytest測試¶

使用pytest運行測試非常簡單：

pytest test_pytest_example.py -v

輸出結果：

============================= test session starts =============================
platform win32 -- Python 3.12.7, pytest-7.4.4, pluggy-1.0.0 -- C:\Users\15696\anaconda3\python.exe
cachedir: .pytest_cache
rootdir: D:\PyCharm\AutoBlog\Python從入門到精通\test_files
plugins: anyio-4.2.0, typeguard-4.4.2
collected 26 items

test_pytest_example.py::test_add PASSED                                  [  3%]
test_pytest_example.py::test_subtract PASSED                             [  7%]
test_pytest_example.py::test_multiply PASSED                             [ 11%]
test_pytest_example.py::test_divide PASSED                               [ 15%]
test_pytest_example.py::test_divide_by_zero PASSED                       [ 19%]
test_pytest_example.py::test_power PASSED                                [ 23%]
test_pytest_example.py::test_factorial PASSED                            [ 26%]
test_pytest_example.py::test_factorial_negative PASSED                   [ 30%]
test_pytest_example.py::test_add_parametrized[1-2-3] PASSED              [ 34%]
test_pytest_example.py::test_add_parametrized[0-0-0] PASSED              [ 38%]
test_pytest_example.py::test_add_parametrized[-1-1-0] PASSED             [ 42%]
test_pytest_example.py::test_add_parametrized[100-200-300] PASSED        [ 46%]
test_pytest_example.py::test_add_parametrized[0.1-0.2-0.3] PASSED        [ 50%]
test_pytest_example.py::test_factorial_parametrized[0-1] PASSED          [ 53%]
test_pytest_example.py::test_factorial_parametrized[1-1] PASSED          [ 57%]
test_pytest_example.py::test_factorial_parametrized[2-2] PASSED          [ 61%]
test_pytest_example.py::test_factorial_parametrized[3-6] PASSED          [ 65%]
test_pytest_example.py::test_factorial_parametrized[4-24] PASSED         [ 69%]
test_pytest_example.py::test_factorial_parametrized[5-120] PASSED        [ 73%]
test_pytest_example.py::test_large_factorial PASSED                      [ 76%]
test_pytest_example.py::test_basic_operations PASSED                     [ 80%]
test_pytest_example.py::test_edge_cases PASSED                           [ 84%]
test_pytest_example.py::TestCalculatorClass::test_addition_in_class PASSED [ 88%]
test_pytest_example.py::TestCalculatorClass::test_multiplication_in_class PASSED [ 92%]
test_pytest_example.py::test_intentional_failure XFAIL (這是一個預期...) [ 96%]
test_pytest_example.py::test_skipped_test SKIPPED (暫時跳過這個測試)     [100%]

============ 24 passed, 1 skipped, 1 xfailed, 3 warnings in 0.15s =============

pytest的Fixtures¶

Fixtures是pytest的核心特性，提供了靈活的測試數據和資源管理：

@pytest.fixture(scope="module")
def module_calculator():
    """模塊級別的Calculator fixture"""
    print("\n創建模塊級Calculator實例")
    calc = Calculator()
    yield calc
    print("\n清理模塊級Calculator實例")

@pytest.fixture
def calculator():
    """函數級別的Calculator fixture（默認作用域）"""
    print("\n創建Calculator實例")
    calc = Calculator()
    yield calc
    print("\n清理Calculator實例")

Fixture的作用域包括：
- function：每個測試函數都會創建新的fixture（默認）
- class：每個測試類創建一次
- module：每個模塊創建一次
- session：整個測試會話創建一次

參數化測試¶

pytest的參數化測試功能非常強大，可以用同一個測試函數測試多組數據：

@pytest.mark.parametrize("a,b,expected", [
    (1, 2, 3),
    (0, 0, 0),
    (-1, 1, 0),
    (100, 200, 300),
    (0.1, 0.2, 0.3),
])
def test_add_parametrized(calculator, a, b, expected):
    """參數化測試加法"""
    result = calculator.add(a, b)
    if isinstance(expected, float):
        assert result == pytest.approx(expected)
    else:
        assert result == expected

@pytest.mark.parametrize("n,expected", [
    (0, 1), (1, 1), (2, 2), (3, 6), (4, 24), (5, 120),
])
def test_factorial_parametrized(calculator, n, expected):
    """參數化測試階乘"""
    assert calculator.factorial(n) == expected

測試標記¶

pytest支持給測試打標記，方便分組運行：

@pytest.mark.slow
def test_large_factorial(calculator):
    """測試大數階乘（標記爲慢測試）"""
    result = calculator.factorial(10)
    assert result == 3628800

@pytest.mark.basic
def test_basic_operations(calculator):
    """基礎運算測試"""
    assert calculator.add(1, 1) == 2
    assert calculator.subtract(1, 1) == 0

# 只運行標記爲basic的測試
# pytest -m basic test_pytest_example.py

跳過和預期失敗¶

pytest提供了靈活的測試跳過和預期失敗機制：

@pytest.mark.skip(reason="暫時跳過這個測試")
def test_skipped_test(calculator):
    """被跳過的測試"""
    assert True

@pytest.mark.xfail(reason="這是一個預期失敗的測試，用於演示")
def test_intentional_failure(calculator):
    """故意失敗的測試"""
    assert calculator.add(2, 2) == 5  # 故意寫錯

pytest配置¶

可以在項目根目錄創建pytest.ini配置文件：

[tool:pytest]
testpaths = tests
python_files = test_*.py
python_classes = Test*
python_functions = test_*
addopts = -v --tb=short
markers =
    slow: marks tests as slow
    basic: marks tests as basic functionality
    edge_case: marks tests as edge cases

16.4 Mock和測試替身¶

在實際項目中，我們的代碼往往依賴外部服務、數據庫、文件系統等資源。在測試時，我們不希望真正調用這些外部依賴，這時就需要使用Mock技術。

Mock概念¶

Mock是測試替身的一種，用於模擬真實對象的行爲。測試替身主要包括：

Dummy：僅用於填充參數，不會被真正使用
Fake：有簡化的實現，如內存數據庫
Stub：返回預設的響應
Mock：可以驗證交互行爲的對象
Spy：記錄如何被調用的真實對象

unittest.mock模塊¶

Python標準庫的unittest.mock提供了強大的Mock功能：

import unittest
from unittest.mock import Mock, MagicMock, patch, mock_open

class UserService:
    def __init__(self, api_client):
        self.api_client = api_client

    def get_user(self, user_id):
        """獲取用戶信息"""
        response = self.api_client.get(f"/users/{user_id}")
        if response.status_code == 200:
            return response.json()
        else:
            raise ValueError(f"User {user_id} not found")

class TestMockExample(unittest.TestCase):
    def test_mock_basic(self):
        """Mock基礎用法"""
        # 創建Mock對象
        mock_client = Mock()

        # 配置Mock對象的返回值
        mock_response = Mock()
        mock_response.status_code = 200
        mock_response.json.return_value = {"id": 1, "name": "張三"}

        mock_client.get.return_value = mock_response

        # 測試
        service = UserService(mock_client)
        user = service.get_user(1)

        # 驗證結果
        self.assertEqual(user["name"], "張三")
        # 驗證Mock調用
        mock_client.get.assert_called_once_with("/users/1")

運行Mock測試¶

運行上面的Mock測試：

python test_mock_example.py -v

輸出結果：

test_file_operations (__main__.TestMockExample.test_file_operations)
測試文件操作的Mock ... ok
test_mock_basic (__main__.TestMockExample.test_mock_basic)
Mock基礎用法 ... ok
test_mock_side_effect (__main__.TestMockExample.test_mock_side_effect)
測試Mock的副作用 ... ok
test_notification_service (__main__.TestMockExample.test_notification_service)
測試通知服務 ... ok
test_patch_decorator (__main__.TestMockExample.test_patch_decorator)
使用patch裝飾器模擬time.sleep ... ok
test_save_data (__main__.TestMockExample.test_save_data)
測試保存數據 ... ok

----------------------------------------------------------------------
Ran 6 tests in 0.018s

OK

高級Mock技術¶

1. 副作用（Side Effects）¶

Mock可以模擬不同的返回值或異常：

def test_mock_side_effect(self):
    """測試Mock的副作用"""
    mock_client = Mock()

    # 設置副作用：第一次調用成功，第二次失敗
    responses = [
        Mock(status_code=200, json=lambda: {"id": 1, "name": "張三"}),
        Mock(status_code=404)
    ]
    mock_client.get.side_effect = responses

    service = UserService(mock_client)

    # 第一次調用成功
    user = service.get_user(1)
    self.assertEqual(user["name"], "張三")

    # 第二次調用失敗
    with self.assertRaises(ValueError):
        service.get_user(2)

2. patch裝飾器¶

patch裝飾器可以臨時替換模塊中的對象：

@patch('builtins.open', new_callable=mock_open, read_data='{"name": "test", "version": "1.0"}')
def test_file_operations(self, mock_file):
    """測試文件操作的Mock"""
    processor = FileProcessor()
    config = processor.read_config("config.json")

    self.assertEqual(config["name"], "test")
    # 驗證文件打開調用
    mock_file.assert_called_once_with("config.json", 'r', encoding='utf-8')

@patch('test_mock_example.time.sleep')
def test_patch_decorator(self, mock_sleep):
    """使用patch裝飾器模擬time.sleep"""
    email_service = EmailService()

    start_time = time.time()
    result = email_service.send_email("test@example.com", "Test", "Hello")
    end_time = time.time()

    # 驗證沒有真正sleep，執行很快
    self.assertLess(end_time - start_time, 0.1)
    mock_sleep.assert_called_once_with(1)

3. spec參數¶

使用spec參數可以讓Mock對象具有真實對象的接口：

def test_notification_service(self):
    """測試通知服務"""
    # 創建EmailService的Mock，具有真實對象的接口
    mock_email_service = Mock(spec=EmailService)
    mock_email_service.send_email.return_value = {"status": "sent", "message_id": "msg_123"}

    notification_service = NotificationService(mock_email_service)
    message_id = notification_service.notify_user("user@example.com", "Hello")

    self.assertEqual(message_id, "msg_123")
    mock_email_service.send_email.assert_called_once_with(
        to="user@example.com",
        subject="通知",
        body="Hello"
    )

Mock最佳實踐¶

最小化Mock範圍：只Mock必要的依賴，避免過度Mock
使用spec參數：確保Mock對象接口正確
驗證交互：不僅要驗證返回值，還要驗證方法調用
清晰的Mock設置：Mock配置應該清晰易懂
避免測試Mock本身：測試應該聚焦業務邏輯，而不是Mock行爲

16.5 測試覆蓋率¶

測試覆蓋率是衡量測試質量的重要指標，它告訴我們代碼的哪些部分被測試執行了。

安裝coverage.py¶

coverage.py是Python最流行的覆蓋率工具：

pip install coverage

基礎使用¶

使用coverage運行測試：

# 運行測試並收集覆蓋率數據
coverage run -m unittest test_calculator.py

# 生成覆蓋率報告
coverage report

輸出示例：

Name                    Stmts   Miss  Cover
-------------------------------------------
test_calculator.py         25      0   100%
test_unittest_example.py   45      2    96%
-------------------------------------------
TOTAL                      70      2    97%

HTML覆蓋率報告¶

生成詳細的HTML報告：

coverage html

這會在htmlcov/目錄下生成HTML報告，可以在瀏覽器中打開查看詳細的覆蓋率信息。

配置覆蓋率¶

創建.coveragerc配置文件：

[run]
source = .
omit = 
    */venv/*
    */tests/*
    setup.py

[report]
exclude_lines =
    pragma: no cover
    def __repr__
    raise AssertionError
    raise NotImplementedError

[html]
directory = htmlcov

與pytest集成¶

pytest可以很好地與coverage集成：

# 安裝pytest-cov插件
pip install pytest-cov

# 運行測試並生成覆蓋率報告
pytest --cov=. --cov-report=html --cov-report=term

16.6 調試技術¶

調試是開發過程中不可避免的環節。Python提供了多種調試技術和工具。

print調試¶

最簡單的調試方法是使用print語句：

def calculate_average(numbers):
    print(f"輸入的數字列表: {numbers}")

    if not numbers:
        print("警告: 空列表")
        return 0

    total = sum(numbers)
    print(f"總和: {total}")

    average = total / len(numbers)
    print(f"平均值: {average}")

    return average

# 測試
result = calculate_average([1, 2, 3, 4, 5])
print(f"最終結果: {result}")

logging調試¶

使用logging模塊進行調試更加專業：

import logging

# 配置日誌
logging.basicConfig(
    level=logging.DEBUG,
    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
)

logger = logging.getLogger(__name__)

def process_data(data):
    logger.info(f"開始處理數據，數據長度: {len(data)}")

    try:
        # 數據處理邏輯
        result = []
        for item in data:
            logger.debug(f"處理項目: {item}")
            processed_item = item * 2
            result.append(processed_item)

        logger.info(f"數據處理完成，結果長度: {len(result)}")
        return result

    except Exception as e:
        logger.error(f"數據處理出錯: {e}")
        raise

pdb調試器¶

Python內置的pdb調試器提供了強大的調試功能：

import pdb

def buggy_function(x, y):
    pdb.set_trace()  # 設置斷點
    result = x / y
    return result * 2

# 調試時會在斷點處停下來，可以檢查變量值
result = buggy_function(10, 0)

常用pdb命令：
- n (next)：執行下一行
- s (step)：進入函數內部
- c (continue)：繼續執行
- l (list)：顯示當前代碼
- p variable_name：打印變量值
- h (help)：顯示幫助

IDE調試¶

現代IDE都提供了圖形化調試界面，如PyCharm、VS Code等，它們提供：
- 可視化斷點設置
- 變量監視窗口
- 調用棧查看
- 條件斷點
- 遠程調試支持

16.7 性能分析和優化¶

性能優化是軟件開發的重要環節，Python提供了多種性能分析工具。

時間分析¶

使用time模塊¶

import time

def slow_function():
    time.sleep(1)
    return sum(range(1000000))

# 簡單的時間測量
start_time = time.time()
result = slow_function()
end_time = time.time()

print(f"函數執行時間: {end_time - start_time:.4f} 秒")

使用timeit模塊¶

timeit提供了更精確的時間測量：

import timeit

# 測試代碼片段的執行時間
time_taken = timeit.timeit(
    'sum(range(100))',
    number=10000
)
print(f"執行10000次的總時間: {time_taken:.4f} 秒")

# 比較不同實現的性能
setup_code = "numbers = list(range(1000))"

list_comp_time = timeit.timeit(
    '[x*2 for x in numbers]',
    setup=setup_code,
    number=1000
)

map_time = timeit.timeit(
    'list(map(lambda x: x*2, numbers))',
    setup=setup_code,
    number=1000
)

print(f"列表推導時間: {list_comp_time:.4f} 秒")
print(f"map函數時間: {map_time:.4f} 秒")

使用cProfile¶

cProfile是Python內置的性能分析器：

import cProfile
import pstats

def fibonacci(n):
    if n < 2:
        return n
    return fibonacci(n-1) + fibonacci(n-2)

def main():
    for i in range(30):
        fibonacci(i)

# 性能分析
if __name__ == '__main__':
    profiler = cProfile.Profile()
    profiler.enable()

    main()

    profiler.disable()

    # 生成報告
    stats = pstats.Stats(profiler)
    stats.sort_stats('cumulative')
    stats.print_stats(10)  # 顯示前10個最耗時的函數

內存分析¶

使用tracemalloc¶

Python 3.4+內置的內存追蹤模塊：

import tracemalloc

def memory_heavy_function():
    # 創建大量對象
    data = []
    for i in range(100000):
        data.append([i] * 100)
    return data

# 開始內存追蹤
tracemalloc.start()

# 執行函數
data = memory_heavy_function()

# 獲取內存使用情況
current, peak = tracemalloc.get_traced_memory()
print(f"當前內存使用: {current / 1024 / 1024:.1f} MB")
print(f"峯值內存使用: {peak / 1024 / 1024:.1f} MB")

tracemalloc.stop()

代碼優化策略¶

算法優化：選擇更高效的算法
數據結構優化：使用合適的數據結構
緩存：避免重複計算
向量化：使用NumPy等庫進行向量化計算
併發：利用多線程或多進程

16.8 集成測試和端到端測試¶

單元測試驗證單個組件的功能，而集成測試和端到端測試確保整個系統協同工作。

集成測試¶

集成測試驗證不同模塊之間的交互。在聲紋識別項目中，這意味着測試模型訓練、特徵提取和識別流程的集成。

API集成測試¶

使用requests庫測試API接口：

import requests
import unittest

class APIIntegrationTest(unittest.TestCase):
    def setUp(self):
        self.base_url = "http://localhost:8000"
        self.session = requests.Session()

    def test_user_workflow(self):
        """測試完整的用戶工作流"""
        # 1. 創建用戶
        user_data = {
            "name": "張三",
            "email": "zhangsan@example.com"
        }
        response = self.session.post(f"{self.base_url}/users", json=user_data)
        self.assertEqual(response.status_code, 201)
        user_id = response.json()["id"]

        # 2. 獲取用戶信息
        response = self.session.get(f"{self.base_url}/users/{user_id}")
        self.assertEqual(response.status_code, 200)
        user = response.json()
        self.assertEqual(user["name"], "張三")

        # 3. 更新用戶信息
        update_data = {"email": "zhangsan_new@example.com"}
        response = self.session.patch(f"{self.base_url}/users/{user_id}", json=update_data)
        self.assertEqual(response.status_code, 200)

        # 4. 刪除用戶
        response = self.session.delete(f"{self.base_url}/users/{user_id}")
        self.assertEqual(response.status_code, 204)

數據庫集成測試¶

測試數據庫操作的完整流程：

import sqlite3
import unittest
import tempfile
import os

class DatabaseIntegrationTest(unittest.TestCase):
    def setUp(self):
        # 創建臨時數據庫
        self.db_fd, self.db_path = tempfile.mkstemp()
        self.conn = sqlite3.connect(self.db_path)

        # 創建表
        self.conn.execute('''
            CREATE TABLE users (
                id INTEGER PRIMARY KEY,
                name TEXT NOT NULL,
                email TEXT UNIQUE NOT NULL
            )
        ''')
        self.conn.commit()

    def tearDown(self):
        self.conn.close()
        os.close(self.db_fd)
        os.unlink(self.db_path)

    def test_user_crud_operations(self):
        """測試用戶的CRUD操作"""
        # Create
        cursor = self.conn.cursor()
        cursor.execute("INSERT INTO users (name, email) VALUES (?, ?)", 
                      ("張三", "zhangsan@example.com"))
        user_id = cursor.lastrowid
        self.conn.commit()

        # Read
        cursor.execute("SELECT * FROM users WHERE id = ?", (user_id,))
        user = cursor.fetchone()
        self.assertIsNotNone(user)
        self.assertEqual(user[1], "張三")

        # Update
        cursor.execute("UPDATE users SET email = ? WHERE id = ?", 
                      ("zhangsan_new@example.com", user_id))
        self.conn.commit()

        cursor.execute("SELECT email FROM users WHERE id = ?", (user_id,))
        email = cursor.fetchone()[0]
        self.assertEqual(email, "zhangsan_new@example.com")

        # Delete
        cursor.execute("DELETE FROM users WHERE id = ?", (user_id,))
        self.conn.commit()

        cursor.execute("SELECT * FROM users WHERE id = ?", (user_id,))
        user = cursor.fetchone()
        self.assertIsNone(user)

端到端測試¶

端到端測試從用戶角度測試整個應用流程。

Web應用測試¶

使用Selenium進行Web應用的端到端測試：

from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
import unittest

class WebE2ETest(unittest.TestCase):
    def setUp(self):
        # 使用Chrome瀏覽器（需要安裝ChromeDriver）
        self.driver = webdriver.Chrome()
        self.driver.implicitly_wait(10)

    def tearDown(self):
        self.driver.quit()

    def test_login_workflow(self):
        """測試登錄工作流"""
        # 1. 打開登錄頁面
        self.driver.get("http://localhost:8000/login")

        # 2. 輸入用戶名和密碼
        username_input = self.driver.find_element(By.NAME, "username")
        password_input = self.driver.find_element(By.NAME, "password")

        username_input.send_keys("testuser")
        password_input.send_keys("testpass")

        # 3. 點擊登錄按鈕
        login_button = self.driver.find_element(By.XPATH, "//button[@type='submit']")
        login_button.click()

        # 4. 驗證登錄成功
        WebDriverWait(self.driver, 10).until(
            EC.presence_of_element_located((By.CLASS_NAME, "dashboard"))
        )

        # 5. 驗證用戶信息顯示
        user_info = self.driver.find_element(By.CLASS_NAME, "user-info")
        self.assertIn("testuser", user_info.text)

測試環境管理¶

Docker化測試環境¶

使用Docker創建一致的測試環境：

# Dockerfile.test
FROM python:3.9

WORKDIR /app

COPY requirements.txt .
RUN pip install -r requirements.txt

COPY . .

CMD ["python", "-m", "pytest", "tests/", "-v"]

# docker-compose.test.yml
version: '3.8'
services:
  test-app:
    build:
      context: .
      dockerfile: Dockerfile.test
    depends_on:
      - test-db
    environment:
      - DATABASE_URL=postgresql://test:test@test-db:5432/testdb
    volumes:
      - .:/app

  test-db:
    image: postgres:13
    environment:
      - POSTGRES_DB=testdb
      - POSTGRES_USER=test
      - POSTGRES_PASSWORD=test
    ports:
      - "5432:5432"

運行測試：

# 啓動測試環境
docker-compose -f docker-compose.test.yml up --build

# 運行特定測試
docker-compose -f docker-compose.test.yml run test-app python -m pytest tests/integration/ -v

測試數據管理¶

使用固定的測試數據集：

import json
import os

class TestDataManager:
    def __init__(self, data_dir="test_data"):
        self.data_dir = data_dir
        os.makedirs(data_dir, exist_ok=True)

    def load_test_data(self, filename):
        """加載測試數據"""
        file_path = os.path.join(self.data_dir, filename)
        with open(file_path, 'r', encoding='utf-8') as f:
            return json.load(f)

    def save_test_data(self, filename, data):
        """保存測試數據"""
        file_path = os.path.join(self.data_dir, filename)
        with open(file_path, 'w', encoding='utf-8') as f:
            json.dump(data, f, ensure_ascii=False, indent=2)

# 使用示例
test_data_manager = TestDataManager()

# 創建測試用戶數據
users_data = [
    {"id": 1, "name": "張三", "email": "zhangsan@example.com"},
    {"id": 2, "name": "李四", "email": "lisi@example.com"},
    {"id": 3, "name": "王五", "email": "wangwu@example.com"}
]
test_data_manager.save_test_data("users.json", users_data)

16.9 測試最佳實踐¶

經過多年的項目實踐，總結出以下測試最佳實踐，這些經驗在聲紋識別項目和其他機器學習項目中都得到了驗證。

測試設計原則¶

1. FIRST原則¶

好的測試應該遵循FIRST原則：

Fast（快速）：測試應該快速執行，單元測試通常在毫秒級
Independent（獨立）：測試之間不應該相互依賴
Repeatable（可重複）：在任何環境下都能得到相同結果
Self-Validating（自我驗證）：測試結果應該是明確的布爾值
Timely（及時）：測試應該及時編寫，最好採用TDD

2. 測試金字塔¶

合理分配不同層次的測試：

       /\
      /  \     E2E Tests (10%)
     /____\
    /      \   Integration Tests (20%)
   /________\
  /          \ Unit Tests (70%)
 /____________\

70%單元測試：快速、穩定、易維護
20%集成測試：驗證模塊間協作
10%端到端測試：驗證用戶場景

3. 測試用例設計¶

使用等價類劃分和邊界值分析：

def test_age_validation():
    """測試年齡驗證的邊界條件"""
    validator = AgeValidator()

    # 等價類：有效年齡
    assert validator.is_valid(25) == True
    assert validator.is_valid(65) == True

    # 邊界值：最小有效值
    assert validator.is_valid(18) == True
    assert validator.is_valid(17) == False

    # 邊界值：最大有效值
    assert validator.is_valid(100) == True
    assert validator.is_valid(101) == False

    # 特殊值
    assert validator.is_valid(0) == False
    assert validator.is_valid(-1) == False

測試代碼質量¶

1. 可讀性優先¶

測試代碼應該像文檔一樣易讀：

def test_voice_recognition_accuracy_with_noise():
    """在有噪聲環境下驗證聲紋識別準確率"""
    # Given: 準備有噪聲的音頻樣本
    clean_audio = load_audio("test_samples/user1_clean.wav")
    noisy_audio = add_noise(clean_audio, noise_level=0.1)

    # When: 進行聲紋識別
    recognition_result = voice_recognizer.identify(noisy_audio)

    # Then: 驗證識別準確率在可接受範圍內
    assert recognition_result.confidence > 0.8
    assert recognition_result.user_id == "user1"

2. 避免測試代碼重複¶

使用測試工具類和夾具：

class VoiceTestUtils:
    @staticmethod
    def create_test_user(name="張三", voice_samples=None):
        """創建測試用戶"""
        voice_samples = voice_samples or ["sample1.wav", "sample2.wav"]
        return User(name=name, voice_samples=voice_samples)

    @staticmethod
    def assert_recognition_result(result, expected_user_id, min_confidence=0.8):
        """驗證識別結果"""
        assert result.user_id == expected_user_id
        assert result.confidence >= min_confidence

# 在測試中使用
def test_user_registration():
    user = VoiceTestUtils.create_test_user("李四")
    result = voice_system.register_user(user)
    VoiceTestUtils.assert_recognition_result(result, "李四")

持續測試¶

1. CI/CD集成¶

在GitHub Actions中配置自動化測試：

# .github/workflows/test.yml
name: Tests

on: [push, pull_request]

jobs:
  test:
    runs-on: ubuntu-latest
    strategy:
      matrix:
        python-version: [3.8, 3.9, 3.10]

    steps:
    - uses: actions/checkout@v2

    - name: Set up Python ${{ matrix.python-version }}
      uses: actions/setup-python@v2
      with:
        python-version: ${{ matrix.python-version }}

    - name: Install dependencies
      run: |
        python -m pip install --upgrade pip
        pip install -r requirements.txt
        pip install pytest pytest-cov

    - name: Run tests
      run: |
        pytest --cov=. --cov-report=xml

    - name: Upload coverage
      uses: codecov/codecov-action@v1
      with:
        file: ./coverage.xml

2. 質量門禁¶

設置測試覆蓋率和質量標準：

# pytest.ini
[tool:pytest]
minversion = 6.0
addopts = 
    --strict-markers
    --cov=src
    --cov-fail-under=80
    --cov-report=term-missing
    --cov-report=html
    --cov-report=xml
testpaths = tests

團隊協作¶

1. 測試約定¶

制定團隊測試規範：

# 測試文件命名規範
tests/
├── unit/
│   ├── test_user_service.py
│   ├── test_voice_processor.py
│   └── test_model_trainer.py
├── integration/
│   ├── test_api_endpoints.py
│   └── test_database_operations.py
└── e2e/
    └── test_voice_recognition_workflow.py

# 測試方法命名規範
def test_should_return_true_when_valid_input():
    """測試方法應該描述預期行爲"""
    pass

def test_should_raise_exception_when_invalid_user():
    """異常測試應該明確說明觸發條件"""
    pass

2. 代碼審查檢查清單¶

測試相關的代碼審查要點：

[ ] 測試覆蓋了主要的業務邏輯路徑
[ ] 包含了邊界條件和異常情況的測試
[ ] 測試代碼清晰易懂，有適當的註釋
[ ] Mock使用合理，沒有過度模擬
[ ] 測試數據合理，不包含敏感信息
[ ] 集成測試驗證了關鍵的業務流程

特殊場景測試¶

1. 機器學習模型測試¶

對於聲紋識別這樣的機器學習項目，還需要特殊的測試策略：

def test_model_performance_regression():
    """測試模型性能不出現迴歸"""
    # 使用固定的測試數據集
    test_dataset = load_test_dataset("voice_test_set.pkl")

    # 加載基準模型結果
    baseline_accuracy = load_baseline_metrics()["accuracy"]

    # 測試當前模型
    current_accuracy = evaluate_model(current_model, test_dataset)

    # 確保性能不低於基準
    assert current_accuracy >= baseline_accuracy * 0.95  # 允許5%的波動

def test_model_bias_fairness():
    """測試模型在不同羣體上的公平性"""
    male_samples = load_voice_samples(gender="male")
    female_samples = load_voice_samples(gender="female")

    male_accuracy = evaluate_model(model, male_samples)
    female_accuracy = evaluate_model(model, female_samples)

    # 確保性別間的準確率差異不超過5%
    accuracy_diff = abs(male_accuracy - female_accuracy)
    assert accuracy_diff <= 0.05

2. 性能測試¶

確保系統在負載下的表現：

import time
import concurrent.futures

def test_concurrent_voice_recognition():
    """測試併發聲紋識別性能"""
    audio_samples = [f"test_audio_{i}.wav" for i in range(100)]

    start_time = time.time()

    with concurrent.futures.ThreadPoolExecutor(max_workers=10) as executor:
        futures = [executor.submit(voice_recognizer.identify, audio) 
                  for audio in audio_samples]
        results = [future.result() for future in futures]

    end_time = time.time()

    # 驗證處理時間
    total_time = end_time - start_time
    avg_time_per_sample = total_time / len(audio_samples)

    assert avg_time_per_sample < 0.5  # 平均每個樣本處理時間不超過0.5秒
    assert all(result.confidence > 0.7 for result in results)  # 所有結果置信度合格

通過遵循這些最佳實踐，我們可以構建一個robust、可維護的測試體系，就像在聲紋識別項目中通過完善的測試確保了模型的可靠性一樣。測試不僅僅是驗證代碼正確性的工具，更是保障軟件質量和團隊協作效率的重要手段。

第16章 測試與調試¶