文章详情

短信预约-IT技能 免费直播动态提醒

请输入下面的图形验证码

提交验证

短信预约提醒成功

time.Now() 的意外行为在 goroutine 中

2024-04-04 23:25

关注

怎么入门Golang编程?需要学习哪些知识点?这是新手们刚接触编程时常见的问题;下面编程网就来给大家整理分享一些知识点,希望能够给初学者一些帮助。本篇文章就来介绍《time.Now() 的意外行为在 goroutine 中》,涉及到,有需要的可以收藏一下

问题内容

作为尝试熟悉 go 的一种方式,我正在尝试构建一个(完全不可靠的)随机数生成器。这个想法是对某个 url 进行 100 个 get 请求,对结果进行一些处理并生成一个“随机”数字。

我有兴趣看看在工作组中使用 goroutine 来执行请求时代码是否会运行得更快。答案似乎是肯定的,但是当打印出各个请求的计时结果时,goroutine 调用的计时显示出一个有趣的结果。

get 请求的顺序时间(以微秒为单位):

[25007 30502 25594 40417 31505 18502 20503 19034 19473 18001 36507 25004 28005 19004 20502 20503 20503 20504 20002 190 03 20511 18494 20003 21004 20003 20502 20504 19002 19004 21506 29501 30005 31005 21504 20054 22452 19503 19503 20003 19503 21 004 18501 18003 20003 20003 19003 19503 20003 23504 18003 20003 19503 19502 19003 20003 20003 20040 21010 18959 20503 34251 27260 30504 25004 22004 20502 20003 19503 205 02 20504 19503 22003 19003 19003 20003 20002 18003 19503 19003 18503 20504 18552 18953 18002 20003 19004 21002 18503 20503 19 503 20504 20003 20003 21003 46050 19504 18503 19503 19503 19002]

goroutine get 请求的计时(以微秒为单位):

[104518 134570 157528 187533 193535 193535 208036 211041 220039 220242 252044 252044 258045 258045 258045 258045 27104 7 282050 282050 282050 286050 287050 289051 296052 297552 300052 300678 305553 307053 308054 310556 311069 312055 312555 3240 56 329558 334559 339559 346061 353562 360563 369564 375065 377566 384067 393569 397069 402570 410072 416572 420573 425574 431076 437576 443078 446577 453579 458580 465081 474583 480584 488085 496122 505588 510589 5155 90 520591 526592 533593 538596 544595 549596 555097 563098 569600 575100 584101 589604 595604 604106 610606 620609 634111 640 611 645613 653119 656616 663116 669117 674118 681119 696122 709123 723627 735629 747631 757632 769635 779137 785139]

goroutine 调用的计时是增量的,而常规的顺序计时是预期的。我怀疑这可能与 time.now() 对所有 gorotine 进行一次评估有关,但随机调用该调用并没有改变结果。

这就是我到目前为止所拥有的,我知道熵并不是随机性的一个很好的衡量标准,但出于某种原因我还是把它包括了:)

首先运行 goroutine,然后运行顺序版本。最后,打印时间和其他一些内容。

package main

import (
    "fmt"
    "log"
    "math/rand"
    "net/http"
    "sync"
    "time"

    "github.com/montanaflynn/stats"
)

func doGet(address string, channel chan int, wg *sync.WaitGroup) {
    // do get request in a workgroup
    // defer wg signal
    defer wg.Done()
    // start time
    startTime := time.Now()
    _, err := http.Get(address)
    if err != nil {
        log.Fatalln(err)
    }
    // get time since start
    delta := int(time.Since(startTime).Microseconds())
    channel <- delta
}

func doGetNoWg(address string) int {
    // do get request without a workgroup/channel
    start := time.Now()
    _, err := http.Get(address)
    if err != nil {
        log.Fatalln(err)
    }
    return int(time.Since(start).Microseconds())
}

func main() {
    var wg sync.WaitGroup
    // initialize arrays for the timings
    var timings_parallel [100]int
    var timings_sequential [100]int
    // get a small uniform set for comparison of entropy
    zeroes := []int{1, 1, 1}
    // get a random set for comparison of entropy
    var randnrs [100]int
    for i := 0; i < len(randnrs); i++ {
        randnrs[i] = rand.Intn(250)
    }
    // start
    start := time.Now()
    ch := make(chan int, 100)
    url := "https://www.nu.nl"
    wg.Add(100)
    for i, _ := range timings_parallel {
        // can this be done without dummy assignemnt or explicit counters?
        i = i
        go doGet(url, ch, &wg)
    }
    wg.Wait()
    close(ch)
    // feed the results from the channel into the result array
    count := 0
    for ret := range ch {
        timings_parallel[count] = ret
        count++
    }
    // get total running time for this part
    time_parallel := time.Since(start).Milliseconds()

    // start of the sequential part
    start = time.Now()
    for i, _ := range timings_sequential {
        timings_sequential[i] = doGetNoWg(url)
    }
    // end sequential part. Why was I using goroutines again? :P
    time_sequential := time.Since(start).Milliseconds()
    
    // calculate entropy
    entropy, _ := stats.Entropy(stats.LoadRawData(timings_parallel[:]))
    entropy_equal, _ := stats.Entropy(stats.LoadRawData(zeroes[:]))
    entropy_random, _ := stats.Entropy(stats.LoadRawData(randnrs[:]))

    // print out stuffs
    fmt.Print("Parallel: ")
    fmt.Printf("%v\n", timings_parallel)
    fmt.Print("Sequential: ")
    fmt.Printf("%v\n", timings_sequential)
    fmt.Printf("Entropy equal: %v\n", entropy_equal)
    fmt.Printf("Entropy random: %v\n", entropy_random)
    fmt.Printf("Entropy: %v\n", entropy)
    fmt.Printf("Time elapsed parallel: %v\n", time_parallel)
    fmt.Printf("Time elapsed sequential: %v", time_sequential)

}

示例输出(不含时序数组):

熵等于:1.0986122886681096 随机熵:4.39737296171013 熵:4.527705829831552 并行时间经过:786 连续经过的时间:2160

因此,goroutines 部分似乎要快得多,而各个时间似乎要快得多。有人知道如何正确安排时间吗(或者为什么他们会如此预期)?

=====更新

goroutines 的最后一个计时几乎总是等于或比 中测量的总时间低一毫秒time elapsed parallel

=====更新2 问题似乎是第一次调用 time.now() 总是产生相同的时间,而第二次调用 time.now() 工作正常。至少这解释了结果:

开始时间: 2022-04-05 18:47:06.3117452 +0200 cest m=+0.004000601 gostop: 2022-04-05 18:47:06.4736105 +0200 cest m=+0.165865901 开始时间: 2022-04-05 18:47:06.3117452 +0200 cest m=+0.004000601 gostop: 2022-04-05 18:47:06.4736105 +0200 cest m=+0.165865901 ... 开始时间: 2022-04-05 18:47:06.3117452 +0200 cest m=+0.004000601 gostop: 2022-04-05 18:47:06.6234215 +0200 cest m=+0.315676901


正确答案


导致此行为的原因在于 go 的调度程序(shorter version of this question at golang-nuts)。上面的 goroutine 都在同一时间点开始执行(正如计时所示,加上对 starttime 变量的内存位置的检查证明时间对象没有“回收”),但是一旦它们命中 http.get() 就会被取消调度。计时是增量的,因为 http.get() 创建了一个瓶颈,不允许并发执行生成的 goroutine 数量。这里似乎使用了某种 fifo 队列。
推荐观看和阅读:
Explaining the Golang I/O multiplexing netpoller model
Queues, Fairness and the Go scheduler

在调整等待组大小时,我发现一些值显示出更一致的计时(而不是增量)。所以我想知道等待组大小对总时间和个人时间的影响是什么。我将上面的内容重构为一个程序,该程序在给定范围内对每个 waitgroupsize 进行多次实验,并将每次运行的总计时和单独计时保存到 sqlite 数据库中。生成的数据集可以轻松地用于例如jupyter 笔记本。不幸的是,在当前的设置下,我只能达到大约 40k 的请求,然后就受到限制。如果您感兴趣但又不想等待数据,请参阅 my github 以获取某些数据集,因为需要很长时间才能完成。有趣的结果是,小工作组大小的并发/顺序比率急剧下降,最后您会看到连接开始受到限制。这次运行当时被手动中止。
并发运行时间/顺序运行时间 vs 等待组大小:

不同等待组大小的各个时间的一些图。

package main

import (
    "database/sql"
    "fmt"
    "log"
    "net/http"
    "os"
    "path/filepath"
    "runtime"
    "sync"
    "time"

    _ "github.com/mattn/go-sqlite3"
)

///// global vars
const REQUESTS int = 100           // Single run size, performed two times (concurrent and sequential)
const URL string = "SET_YOUR_OWN" // Some file on a CDN somewhere; used for the GET requests
const DBNAME string = "netRand.db" // Name of the db file. Saved next to the executable
const WGMIN int = 1                // Start range for waitgroup size (inclusive)
const WGMAX int = 101              // Stop range for waitgroup size (exclusive)
const NREPEAT int = 10             // Number of times to repeat a run for a specific waitgroup size

//// types
type timingResult struct {
    // Container for collecting results before persisting to DB
    WaitgroupSize       int
    ConcurrentTimingsMs [REQUESTS]int64
    ConcurrentTotalMs   int64
    SequentialTimingsMs [REQUESTS]int64
    SequentialTotalMs   int64
}

//// main
func main() {
    db := setupDb()
    defer db.Close()
    for i := WGMIN; i < WGMAX; i++ {
        // waitgroup size range
        for j := 0; j < NREPEAT; j++ {
            // repeat for more data points
            timings := requestTimes(i)
            persistTimings(timings, db)
            fmt.Printf("\n======== %v of %v ============\n", j+1, NREPEAT)
            fmt.Printf("current waitgroup size: %v\n", i)
            fmt.Printf("max waitgroup size: %v\n", WGMAX-1)
        }
    }

}

func requestTimes(waitgroupSize int) timingResult {
    // do NTIMES requests in go routines with waitgroupSize
    // do NTIMES requests sequentially

    timings_concurrent, total_concurrent := concurrentRequests(waitgroupSize)
    timings_sequential, total_sequential := sequentialRequests()

    return timingResult{
        WaitgroupSize:       waitgroupSize,
        ConcurrentTimingsMs: timings_concurrent,
        ConcurrentTotalMs:   total_concurrent,
        SequentialTimingsMs: timings_sequential,
        SequentialTotalMs:   total_sequential,
    }

}
func persistTimings(timings timingResult, db *sql.DB) {
    persistRun(timings, db)
    currentRunId := getCurrentRunId(db)
    persistConcurrentTimings(currentRunId, timings, db)
    persistSequentialTimings(currentRunId, timings, db)
}
func concurrentRequests(waitgroupSize int) ([REQUESTS]int64, int64) {
    start := time.Now()

    var wg sync.WaitGroup
    var timings [REQUESTS]int64
    ch := make(chan int64, REQUESTS)

    for i := range timings {
        wg.Add(1)
        go func() {
            defer wg.Done()
            doGetChannel(URL, ch)
        }()
        // waitgroupsize is controlled using modulo
        // making sure experiment size is always NTIMES
        // independent of waitgroupsize
        if i%waitgroupSize == 0 {
            wg.Wait()
        }
    }
    wg.Wait()
    close(ch)

    count := 0
    for ret := range ch {
        timings[count] = ret
        count++
    }

    return timings, time.Since(start).Milliseconds()
}
func doGetChannel(address string, channel chan int64) {
    // time get request and send to channel
    startSub := time.Now().UnixMilli()
    _, err := http.Get(address)
    if err != nil {
        log.Fatalln(err)
    }
    stopSub := time.Now().UnixMilli()
    delta := stopSub - startSub
    channel <- delta
}
func sequentialRequests() ([REQUESTS]int64, int64) {
    startGo := time.Now()
    var timings_sequential [REQUESTS]int64
    for i := range timings_sequential {
        timings_sequential[i] = doGetReturn(URL)
    }
    return timings_sequential, time.Since(startGo).Milliseconds()
}
func doGetReturn(address string) int64 {
    // time get request without a waitgroup/channel
    start := time.Now()
    _, err := http.Get(address)
    if err != nil {
        log.Fatalln(err)
    }
    duration := time.Since(start).Milliseconds()
    return duration
}

//// DB
func setupDb() *sql.DB {
    //      __________________________runs____________________
    //     |                                                  |
    // concurrent_timings(fk: run_id)         sequential_timings(fk: run_id)
    //
    const createRuns string = `
    CREATE TABLE IF NOT EXISTS runs (
    run_id INTEGER NOT NULL PRIMARY KEY,
    time TIMESTAMP NOT NULL DEFAULT CURRENT_TIMESTAMP,
    waitgroup_size INTEGER,
    concurrent_total_ms INTEGER,
    sequential_total_ms INTEGER,
    concurrent_sequential_ratio REAL
    );`

    const createSequentialTimings string = `
    CREATE TABLE IF NOT EXISTS sequential_timings (
    run INTEGER,
    call_number INTEGER,
    timing_ms INTEGER,
    FOREIGN KEY(run) REFERENCES runs(run_id)
    );`

    const createConcurrentTimings string = `
    CREATE TABLE IF NOT EXISTS concurrent_timings (
    run INTEGER,
    channel_position INTEGER,
    timing_ms INTEGER,
    FOREIGN KEY(run) REFERENCES runs(run_id)
    );`
    // retrieve platform appropriate connection string
    dbString := getConnectionString(DBNAME)
    db, err := sql.Open("sqlite3", dbString)
    if err != nil {
        log.Fatalln(err)
    }
    if _, err := db.Exec(createRuns); err != nil {
        log.Fatalln(err)
    }
    if _, err := db.Exec(createSequentialTimings); err != nil {
        log.Fatalln(err)
    }
    if _, err := db.Exec(createConcurrentTimings); err != nil {
        log.Fatalln(err)
    }
    return db
}
func getConnectionString(dbName string) string {
    // Generate platform appropriate connection string
    // the db is placed in the same directory as the current executable

    // retrieve the path to the currently executed executable
    ex, err := os.Executable()
    if err != nil {
        panic(err)
    }
    // retrieve path to containing dir
    dbDir := filepath.Dir(ex)

    // Append platform appropriate separator and dbName
    if runtime.GOOS == "windows" {

        dbDir = dbDir + "\\" + dbName

    } else {
        dbDir = dbDir + "/" + dbName
    }
    return dbDir
}
func persistRun(timings timingResult, db *sql.DB) {
    tx, err := db.Begin()
    if err != nil {
        log.Fatalln(err)
    }

    insertRun, err := db.Prepare(`INSERT INTO runs(
        waitgroup_size, 
        sequential_total_ms, 
        concurrent_total_ms, 
        concurrent_sequential_ratio) 
        VALUES(?, ?, ?, ?)`)

    if err != nil {
        log.Fatalln(err)
    }
    defer tx.Stmt(insertRun).Close()
    _, err = tx.Stmt(insertRun).Exec(
        timings.WaitgroupSize,
        timings.SequentialTotalMs,
        timings.ConcurrentTotalMs,
        float32(timings.ConcurrentTotalMs)/float32(timings.SequentialTotalMs),
    )
    if err != nil {
        log.Fatalln(err)
    }
    err = tx.Commit()

    if err != nil {
        log.Fatalln(err)
    }
}

func getCurrentRunId(db *sql.DB) int {
    rows, err := db.Query("SELECT MAX(run_id) FROM runs")
    if err != nil {
        log.Fatal(err)
    }
    var run_id int
    for rows.Next() {
        err = rows.Scan(&run_id)
        if err != nil {
            log.Fatalln(err)
        }
    }
    rows.Close()
    return run_id
}
func persistConcurrentTimings(runId int, timings timingResult, db *sql.DB) {
    tx, err := db.Begin()
    if err != nil {
        log.Fatalln(err)
    }

    insertTiming, err := db.Prepare(`INSERT INTO concurrent_timings(
        run, 
        channel_position, 
        timing_ms) 
        VALUES(?, ?, ?)`)

    if err != nil {
        log.Fatalln(err)
    }
    for i, timing := range timings.ConcurrentTimingsMs {
        _, err = tx.Stmt(insertTiming).Exec(
            runId,
            i,
            timing,
        )
        if err != nil {
            log.Fatalln(err)
        }
    }

    err = tx.Commit()

    if err != nil {
        log.Fatalln(err)
    }
}
func persistSequentialTimings(runId int, timings timingResult, db *sql.DB) {
    tx, err := db.Begin()
    if err != nil {
        log.Fatalln(err)
    }

    insertTiming, err := db.Prepare(`INSERT INTO sequential_timings(
        run, 
        call_number, 
        timing_ms) 
        VALUES(?, ?, ?)`)

    if err != nil {
        log.Fatalln(err)
    }
    for i, timing := range timings.SequentialTimingsMs {
        _, err = tx.Stmt(insertTiming).Exec(
            runId,
            i,
            timing,
        )
        if err != nil {
            log.Fatalln(err)
        }
    }

    err = tx.Commit()

    if err != nil {
        log.Fatalln(err)
    }
}

理论要掌握,实操不能落!以上关于《time.Now() 的意外行为在 goroutine 中》的详细介绍,大家都掌握了吧!如果想要继续提升自己的能力,那么就来关注编程网公众号吧!

阅读原文内容投诉

免责声明:

① 本站未注明“稿件来源”的信息均来自网络整理。其文字、图片和音视频稿件的所属权归原作者所有。本站收集整理出于非商业性的教育和科研之目的,并不意味着本站赞同其观点或证实其内容的真实性。仅作为临时的测试数据,供内部测试之用。本站并未授权任何人以任何方式主动获取本站任何信息。

② 本站未注明“稿件来源”的临时测试数据将在测试完成后最终做删除处理。有问题或投稿请发送至: 邮箱/279061341@qq.com QQ/279061341

软考中级精品资料免费领

  • 历年真题答案解析
  • 备考技巧名师总结
  • 高频考点精准押题
  • 2024年上半年信息系统项目管理师第二批次真题及答案解析(完整版)

    难度     813人已做
    查看
  • 【考后总结】2024年5月26日信息系统项目管理师第2批次考情分析

    难度     354人已做
    查看
  • 【考后总结】2024年5月25日信息系统项目管理师第1批次考情分析

    难度     318人已做
    查看
  • 2024年上半年软考高项第一、二批次真题考点汇总(完整版)

    难度     435人已做
    查看
  • 2024年上半年系统架构设计师考试综合知识真题

    难度     224人已做
    查看

相关文章

发现更多好内容

猜你喜欢

AI推送时光机
位置:首页-资讯-后端开发
咦!没有更多了?去看看其它编程学习网 内容吧
首页课程
资料下载
问答资讯