13:59:58 #startmeeting 13:59:58 Meeting started Thu Sep 29 13:59:58 2016 UTC. The chair is karsten. Information about MeetBot at http://wiki.debian.org/MeetBot. 13:59:58 Useful Commands: #action #agreed #help #info #idea #link #topic. 14:00:02 hello! 14:00:06 hi :-) 14:00:10 hey :) 14:00:12 hi 14:00:17 hi mnpkt 14:00:33 https://pad.riseup.net/p/3M7VyrTVgjlF <- agenda pad 14:02:40 alright, looks like we already have a few agenda items there. should we start? 14:02:46 sure. 14:02:52 sure 14:02:55 * CollecTor misc., tasks to be completed Sep 30 (karsten) 14:03:19 so, I think we worked on a few collector-related tickets in the past week or so. 14:03:34 Isn't #19317 completed and can be merged? 14:03:47 and, would be another finished task. 14:03:48 we should try to finish some of them by the end of september, so that we can include them in the monthly report. 14:04:11 it's completed and merged, there were just a few suggestions to be added later on. 14:04:18 like 0b1111_1111. 14:04:29 though I had problems with cobertura not understanding those literals. 14:04:41 ok 14:04:46 but those suggestions were referenced from another ticket, so it's probably safe to close that ticket. 14:05:06 yes 14:05:35 okay, I'll close it after the meeting, after looking once more. 14:05:49 I also worked a bit on #19755 yesterday. 14:06:02 to make sure: 14:06:10 branch task-19755-2: latest 14 commits for review? 14:06:18 heh, let me count them.. 14:06:48 yep. 14. 14:06:56 the first is big, the others are much smaller. 14:07:03 ok. 14:07:09 and I think you saw a very similar version of that first commit earlier. 14:07:21 basically, I cleaned that up to make it easier to review. 14:07:28 and fixed more things. 14:07:33 fine. 14:07:43 doable in Sep. 14:07:51 nice! 14:08:06 then we have #20228 and #20234. 14:08:18 right. 14:08:48 protocol 0.9? 14:08:57 quick idea: can we include a compact version of that text file in index.html? 14:09:08 I'd rather not. 14:09:11 It's 14:09:28 the 99% are not really interested part. 14:10:02 Or, have it all in html. 14:10:06 ? 14:10:06 well, I wonder if we can find a representation that makes it more interesting to the average user. 14:10:33 I mean, we do have an informal version of that protocol on index.html. 14:10:34 like: 14:10:57 "The server descriptors in the descriptor archives contain one descriptor per file, whereas the recently published files contain all descriptors collected in an hour concatenated into a single file. " 14:11:27 maybe we can find a representation for paths that is easily comprehensible and that contains many more details. 14:11:31 That's the part explicitly excluded from the file structure protocol. 14:11:41 ok. 14:12:06 That would need more text I think. 14:12:10 oh, yes. 14:12:19 longer descriptions. 14:12:32 The more technical protocol could be 14:12:42 linked from html first. 14:12:57 I mean , link the git repo document. 14:13:05 If there are request 14:13:17 or questions it could be added later? 14:13:18 maybe we can find a notation with placeholders for "valid-after month", "published second", dash, compression type, etc. 14:13:37 and put something in that notation on the html page. 14:13:48 I don't yet know how exactly that would work. I haven't tried. 14:14:28 as a first step proto 0.9 could be finished? 14:14:57 and as second step we try to simply that and include it in the html? 14:15:45 looks like we lost iwakeh.. 14:16:08 mnpkt: want to discuss your questions? 14:16:13 * short questions about collector file parsing & path selection algorithms (if theres time) (mnpkt) 14:16:16 sure 14:16:18 thanks 14:16:54 so i got a small Rscript to parse part of the consensus files from collector 14:17:12 this helped a lot: http://jordan-wright.com/images/blog/how_tor_works/consensus.png 14:17:26 is there something similar for the other files? 14:17:39 well, there are libraries for the parsing. 14:17:43 not for R though. 14:18:07 but I'd think that other languages are better suited for parsing lots of descriptors anyway. 14:18:07 just documents describing which variable means what? 14:18:16 ah, no, APIs. 14:18:19 that is probably true 14:18:21 :) 14:18:24 do you know java or python? 14:18:27 or maybe go? 14:18:35 a bit python but feel much more comfortable in R 14:19:00 well, you'd only have to use python to turn descriptors into something that R can process more easily. 14:19:06 and I think there are tutorials. 14:19:25 https://stem.torproject.org/tutorials/mirror_mirror_on_the_wall.html 14:19:41 thanks I will give that a try 14:20:04 okay. and your second question was about path selection? 14:20:11 yes 14:20:42 so I looked at the specs and am unclear if it is possible clients select a non guard flagged relay as entry 14:21:03 wb iwakeh 14:21:05 back again 14:21:12 since there are weigths for that in the consensus 14:21:19 hi iwakeh 14:21:21 hmmmm 14:21:29 Wgm - Weight for non-flagged nodes in the guard Position 14:22:20 and additionally if the self described bandwith of the relays is actually used as additional weigth in Guard selection 14:22:32 found that here: 14:22:39 path sprec 5.1 Guard selection algorithm 4. 14:22:45 * CRN_NEED_CAPACITY: potentially suitable routers are weighted by 14:22:45 their advertised bandwidth capacity. 14:22:50 the bandwidth that's contained in the consensus is used. plus the Wxx weight. 14:22:52 https://gitweb.torproject.org/torspec.git/tree/path-spec.txt 14:23:16 I'm surprised that that weight is not 0. 14:23:24 me 2 14:23:27 I can't give you a clear answer now. 14:23:33 ok 14:23:41 you could look at the tor code. 14:23:47 and see what clients do. 14:24:02 or you could look at TorPS, which is a path simulator. 14:24:13 ahh that sound promising 14:24:20 or ask again in this channel when core tor devs are around. 14:24:29 though they're likely busy this week with the meeting. 14:24:43 ok. 14:24:56 hope that helps. shall we go back to the earlier topic? 14:24:59 ok thanks very much 14:25:01 sure 14:25:05 yes. 14:25:15 14:15:00 < karsten> and as second step we try to simply that and include it in the html? 14:25:18 14:15:48 < karsten> looks like we lost iwakeh.. 14:25:25 simplify* 14:25:48 ok, so we can finish proto 0.9 in sept? 14:26:20 my main concern is that we're creating another document that we need to maintain. 14:26:31 see comment:9 of #20234 14:26:38 if we remain open to transforming it to something else, like the html, I'm all for it. 14:26:45 oh, sure. 14:27:20 yes, that comment makes sense to me. 14:27:30 fine :-) 14:27:37 I'll do another review then. 14:28:03 after I add another version with the mentioned changes. 14:28:07 in sept? 14:28:12 oh. 14:28:20 hopefully. :) 14:29:26 we're in agenda point two, now. 14:29:33 then we could try to resolve the question of file-by-published-time vs. file-by-downloaded-time. 14:29:50 #20228 14:29:57 now? 14:30:05 before end of sep. 14:30:21 and maybe send out the tor-dev@ notice tomorrow. 14:30:32 well, ... we 14:30:45 could make that two questions: 14:31:09 1. make the change to append votes into one doc 14:31:50 and 2. how to group the descs into files. 14:31:53 ? 14:32:08 sure, those are separate questions. 14:32:19 the initial vote grouping could be by download time just b/c of the reason that 14:32:48 this is, currently, done with the other descs too. 14:33:47 I still favor grouping by published-time. 14:34:06 I think we have different use cases in mind. 14:34:23 yes, just needs some more pondering over this. 14:34:34 here's an issue with grouping by published time: 14:34:37 (just came to mind) 14:34:45 :-) 14:34:47 imagine you receive a descriptor that was published 70 hours ago. 14:34:56 would you take that out of recent/ 2 hours later? 14:35:08 why not? 14:35:26 it will be in the tar. 14:35:46 yes, but the recent/ dir is for clients that want to stay updated without having to look at the tars. 14:35:51 it's a different channel. 14:36:16 yes, but won't clients be confused by aged descs? 14:36:20 and the idea is that they can fail for 2+ days and still get all the descriptors they need when they come back. 14:36:43 they shouldn't be. 14:36:56 nothing guarantees that descriptors come in order. 14:37:06 but they don't expect them either. 14:37:17 clients? they shouldn't. 14:37:23 not the clients I wrote. 14:37:25 ;) 14:37:33 ok :-) 14:37:34 we *could* make that more explicit. 14:37:49 anyway, happy to discuss more on trac. 14:37:51 and move on to 2. 14:38:00 sure, part is 14:38:16 already answered with the tasks we addressed already. 14:38:19 ok. 14:38:22 #20236 up for review or asking for info/input? 14:38:36 oh! 14:38:42 (Didn't notice #20227 for that reason, i.e. that is was still 'new') 14:38:57 argh, ok. 14:39:13 there was no patch to review.. 14:39:18 just an idea to discuss. 14:39:28 true, could have been needs-info 14:39:35 it's also not urgent, it just came to mind as something that could be done quickly. 14:39:38 right. 14:39:45 I replied already. 14:40:11 okay, sounds good. 14:40:21 so this goes in sep? 14:40:26 quick answer is that we probably don't want to prioritize finding somebody else to work on this. 14:40:35 removing? yes. 14:40:51 but is it important enough to 14:40:52 part of the goal there was to prototype a new graphing engine. 14:41:06 and this was the sample that the volunteer was working on. 14:41:07 keep a note about this somewhere with lower priority? 14:41:25 ah, ok, now I get the point. 14:41:31 then just remove. 14:41:43 yep. and remember locally. :) 14:41:50 I have more ideas here... ;) 14:41:51 on paper. 14:42:03 heh, sort of. 14:42:31 okay, I don't think that patch needs review. 14:42:35 removing code. 14:42:38 fine. 14:42:46 just as a way to speed up things. 14:42:56 realizing that end of september is near. 14:43:02 yep. 14:43:13 okay, what other questions? 14:43:26 topic two is done. 14:43:30 ok. 14:43:35 * Tor Metrics usability analysis (karsten) 14:43:44 I didn't hear back from UX folks. 14:43:44 was it reviewed? 14:44:06 but I wonder if we should give that more time and not publish early. 14:44:26 depends on the way of publishing? 14:44:37 right. 14:45:15 well, now that I think about it, I'd say let's wait another week or two. 14:45:17 what makes you reluctant publishing? 14:45:32 yes, we have time to wait. 14:45:34 that attention will be lower when publishing a revision. 14:45:45 and maybe the revision will be much better with input from UX folks. 14:45:48 that's true. 14:46:02 okay, never mind the agenda item then. 14:46:05 we'll wait. 14:46:26 alright. we already discussed item 4. 14:46:32 what else? :) 14:46:42 hhmm, action items? 14:46:44 :-) 14:46:54 yeah, I collected them during the meeting. 14:46:59 anything I missed? 14:47:11 not that i'm aware of. 14:47:25 okay, cool! 14:47:44 so, talk more next week? 14:47:44 back to work :-) 14:47:46 hehe 14:47:48 sure. 14:48:05 great! thanks! :) 14:48:08 #endmeeting