14:01:57 #startmeeting metrics team 14:01:57 Meeting started Thu Nov 19 14:01:57 2015 UTC. The chair is karsten. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:01:57 Useful Commands: #action #agreed #help #info #idea #link #topic. 14:02:06 hello. who's here for the metrics team meeting? 14:02:11 me! 14:02:17 me too! 14:02:31 me and hi 14:02:40 hi, yorick, tomlurge_, Letty! 14:02:49 https://pad.riseup.net/p/zUNzEIFRq5S4 <- agenda 14:02:57 please add agenda items that you want to discuss today. 14:03:26 I exist 14:03:31 hi virgil 14:03:57 as do I, hello 14:04:20 karsten: me 14:04:23 hi flatline! (we'll need quick introductions from you and yorick to learn what you're up to. we'll get to that in a minute.) 14:04:27 hi qbi! 14:05:12 actually, yorick and flatline, want to say a few words what you're interested in regarding tor metrics? 14:05:32 (everyone, feel free to add agenda items in the meantime.) 14:05:58 yorick, flatline: if you're mostly here to listen, that's fine, too. 14:06:34 I am very happy to say a few words about why I'm here 14:06:39 great! 14:07:12 hi everyone! I came here mostly to ask a question re university project work, I think you guys might have a topic for me 14:07:31 I have a fun little hidden service that seemed like it would be neat to show around 14:07:40 (possibly not, but it would be cool to be able to help out here and also have a topic for my project) 14:08:28 yorick, flatline: great! added to the agenda. 14:08:54 okay, let's start. 14:08:57 * Vegas team lead meeting at 17:30 UTC; anything to bring up there? 14:09:14 so, one remark in berlin was that there's little communication between teams. 14:09:28 and that communication may appear unidirectionally. 14:09:35 let's fix that. what should I bring up? 14:09:43 I'll report back on metrics-team@ later. 14:11:36 anything you're happy or unhappy about and that you want other teams to know? 14:11:38 Do the other teams have protocols? 14:11:47 or some kind of abstract of their last meeting? 14:12:02 like StatusBot logs? 14:12:07 err, MeetBot 14:12:29 or summarized by humans? 14:12:38 MeetBot is fine. 14:12:38 qbi: Error: "is" is not a valid command. 14:12:42 (the answer is 'no clue' regardless, I'm trying to figure out what you'd expect.) 14:13:02 should I suggest using MeetBot to them? 14:13:14 Well, at least for me just reading a log is fine. 14:13:23 I wouldn't require anyone to come here and report. 14:13:24 well, ask if they're already doing that, and if not, suggest it. 14:13:38 okay, but you'd read the log. 14:13:52 I guess that's a fine question I should ask. 14:13:53 Even an agenda would be good. So one could skim over and ask the other team about single issues. 14:14:04 ok. 14:14:12 anything else? 14:14:23 (I'll write that down on the pad in a minute. thanks!) 14:14:40 Information should be pulled, IMHO. 14:14:57 hmm? 14:15:13 Only if we have questions about some issues we should inite a person to come to "our" meetings and report. 14:15:36 But if we just want to know happens in another team I'd suggest to actively look for that information. 14:15:43 ok. 14:15:50 instead of put the burden on other persons. 14:16:12 right, makes sense. 14:16:27 turning on MeetBot is not that much of an inconvenience. 14:16:29 for most. ;) 14:16:39 writing a summary might be. 14:16:45 okay, shall we move on? 14:17:10 * Analytics server (tomlurge) [max 10m] 14:17:15 tomlurge_: want to give an update? 14:17:24 yep: 14:17:49 status is that I’m almost done with the converters from collector to JSON 14:18:07 (if my computer hadn’t crashed last night….) 14:18:24 (good thing you had a recent enough backup!) 14:18:41 plan is to start ingesting collector data into the analytics server for real coming monday 14:18:53 which descriptor types? 14:18:57 relays, bridges, others? 14:19:06 all of them, practically 14:19:14 btw, do you have a link to your github? 14:19:20 so that others can follow if they want? 14:19:39 github.com/tomlurge/mteam 14:19:56 great. and sound like great progress! 14:19:59 sounds* 14:20:06 anything we can help with? 14:20:19 yeah, i was pretty impressed myself :-) 14:20:44 not right now. if you have a spare hour beginnig of next week, that could be useful 14:20:54 sure, that should work. 14:21:02 also I found some missing parts in the metrics-lib library 14:21:05 hehe 14:21:07 even more? 14:21:12 mind filing tickets? 14:21:17 yes, another one 14:21:22 that's the most convenient way to forg^Whandle them. 14:21:37 but more seriously, I'll look into those. 14:21:38 i prefer sending you a mail… 14:21:43 well, or that. 14:22:04 .. there is some more stuff, and a little bug. i’m still collecting... 14:22:06 I'm currently jumping from one code base to the other, depending which of them is most on fire. 14:22:14 that's good. 14:22:32 no fires here, just features, mostly 14:22:35 okay, should we talk more early next week and move on here? 14:22:36 ok. 14:22:41 yep! 14:22:48 * Disagreement of directory authorities (Letty) http://letty.io/tor/ [max 10m] 14:23:06 Since last meeting, i decide to change the visualization from nested circles into a aster plot (i think that is the name) 14:23:17 I hope that improves the the understanding/readability of areas that represent the relay count 14:23:46 are you using areas or heights to represent relay counts? 14:24:11 In the current version is the flag "Fast" used 14:24:25 i use the hight 14:25:01 ok. is that because you think it's the right thing to do or because you didn't change that yet? 14:25:09 * karsten isn't sure what's right either. 14:25:19 Letty: I like seeing it jiggle 14:25:22 i need to find a research paper about this viz, i'm not sure if hight or area is better 14:25:57 okay. 14:26:02 usually stacked bar chart is the suggested way to show parts of a whole 14:26:17 virgil: right, we discussed that as an alternative visualizationt, oo. 14:26:18 -t 14:26:26 Letty: tbh when I visited your site the meaning of the diagram was not clear to me. Could you add some explanantion? 14:26:47 qbi: I was planning to help with writing text once the technical pieces are in place. 14:26:56 qbi: but you could help with that if you want. :) 14:27:11 First, I need to understand it. :-) 14:27:24 i think we have in all viz the same problem. the data is not normal distributed 14:27:30 dark green means that a relay got the Fast flag in this case from all authorities, 14:27:37 yes 14:27:38 lighter green means n-1 authorities. 14:27:45 dark brown means 1. 14:27:58 left half of the circle -> get into consensus, right half -> not in consensus. 14:28:27 qbi: okay, want to help make the initial description that I'll help write better? 14:28:36 of course 14:28:48 I also want to ask harmony for help with that, because he's just awesome with text. 14:28:53 cool. 14:29:06 Letty: so, what can we help with? 14:29:17 besides discussing alternative visualizations. ;) 14:29:43 and if you need more time to discuss things, I'm happy to schedule another time for that, too. 14:30:07 i need an idea what kind of UI element i should use for changing the flags ;) 14:30:22 ah, ok. 14:30:41 (anyone? /me is still thinking) 14:31:01 words? 14:31:16 karsten: and i need something like a git. is it a good idea to have it on my normal github account (not private)? 14:31:19 asn: dgoulet - would be happy to talk about the shared random thing i just emailed about 14:31:30 Letty: yes, github is just fine. 14:31:33 ok 14:31:57 so, something like this: http://getbootstrap.com/components/#nav-pills 14:32:06 or tabs? is that what you're asking? 14:32:15 (tabs are above pills) 14:32:32 actually, I think tomlurge_ would be a good person to talk to. 14:32:43 ioerror: and yes, words would probably work better than icons. 14:33:32 Letty, tomlurge_: want to talk about this after the meeting, maybe? 14:33:49 karsten, letty: sure! 14:34:06 maybe words are the best thing, everthing else is like the 90's.. 14:34:07 perfect! (assuming that Letty is also fine with that idea.) 14:34:21 tomlurge, karsten: yes! 14:34:24 great! 14:34:30 okay, let's move on. 14:34:43 Letty: ping me if you need anything else, like new data. 14:34:48 * Roster update (virgil, Sean) [max 5m -- if there's not much to update] 14:34:54 -- Just working on with the BGP graph 14:35:02 not much to say. Just working on the BGP graph. 14:35:03 aye 14:35:08 working on the diversity measurs 14:35:13 diversity measure* 14:35:18 and Sean has been busy with linear algebra 14:35:33 #undergradlife 14:35:49 ok. I saw threads with juris about things not working as they're supposed to, and then getting fixed. 14:36:00 yeah various bugfixes 14:36:09 most of that seemed to be due to Onionoo issues 14:36:13 oh... 14:36:18 ha ;P 14:36:21 yeah, it's rather unhappy these days. 14:36:28 it's cool. It'll get fixed. 14:36:37 it's on my queue 14:36:53 ah, then let me briefly explain the plan for fixing it: 14:36:56 (well, my plan) 14:37:21 sure 14:37:21 put all documents in the out/ directory into a psql database, replicate that to front-end hosts, run 2 or 3 of them to handle the load. 14:37:32 that shouldn't be terribly hard to implement. 14:37:44 because there's already something like a database abstraction layer in onionoo. 14:37:57 but I'm saying such things about other coding projects, too, and they're never true. 14:38:04 so, should be easy. :) 14:38:35 ok. 14:38:55 if you come up with other good ideas, happy to consider them, too. 14:39:21 should we move on? 14:39:30 move on 14:39:40 ok. 14:39:44 * possible "data mining" projects for university project (yorick) [max 10m] 14:39:54 yorick: what do you have in mind, roughly? 14:40:03 I am a second-year university-level computer science student, doing a "data mining" course. 14:40:04 yorick: and do you know the various things that already exist in the metrics space? 14:40:12 This course involves a project (about 60 hours of work split over two people, mostly involving applying algorithms to a dataset and hoping anything interesting shows up) and I am still looking for a good subject 14:40:20 some guy suggested I ask here 14:41:14 karsten: I'm not all too familiar with the tor-metrics space, but you seem to have some interesting data available 14:41:17 ioerror: see #tor-project please 14:41:20 https://lists.torproject.org/pipermail/metrics-team/2015-November/000002.html 14:41:26 https://people.torproject.org/~karsten/volatile/metrics-data-visualizations.pdf 14:41:37 oha. couldn’t that be a good use of the analytics server? 14:41:54 yorick: I wonder if page 2 in that pdf would be something you'd want to do. 14:41:57 yorick, do you have to use a specific programming language? 14:42:00 yorick: possibly with the analytics server. 14:42:37 though, tomlurge_, it's not ready yet. 14:42:55 and maybe this project should start with some raw data. 14:43:03 not with everything nicely put into a database. 14:43:06 that would be the next question: what’s the timeline 14:43:09 (/me is mostly guessing here.) 14:43:24 tomlurge_: no specific programming language. timeline is "before jan 20th" 14:43:33 well, if it’s about algorithms… 14:44:06 yorick: so, assuming this recommended versions thing is remotely interesting to you, 14:44:24 I could write you a short introduction to what data and tools you would need. 14:45:27 tomlurge_: I think the analytics server is too much for 60 hours for 2 people. 14:45:45 too much in what sense? 14:45:49 yorick: want to think about it and drop me an email? 14:46:06 karsten: sure! data viz isn't quite in the scope, but it can still work 14:46:06 tomlurge_: I think you'd need those 60 hours to get started, but then you don't have any useful results yet. 14:46:21 yorick: well, it doesn't have to do fancy visualization parts. 14:46:36 yorick: it could just provide data for a visualization like that, and use something simple to present them. 14:46:43 maybe even just a table. 14:47:06 karsten: just counting versions does not sound very challenging 14:47:30 I think it gets more interesting when you look how outdated relays are, 14:47:39 and what happens when new releases come out, etc. 14:47:49 I don't know what excitements you'll encounter while digging. 14:48:20 the reason why I'm suggesting it is that the data is not that hard to understand, so that there's little time you need to spend on getting started. 14:48:22 the problem with the analytics server is that it’s not finished yet. but when it’s finished it’s whole purpose is that you code algorithms and let them run on it. so it seems like a good fit to me 14:48:44 tomlurge_: okay, let's keep that in mind as an option. 14:48:57 but, we should move on. 14:48:58 karsten: okay, I'll send you that email 14:49:03 yorick: great! 14:49:06 :) 14:49:11 * fun little hidden service (flatline) [max 10m] 14:49:16 what's that about? 14:49:44 it's about allowing for contact with hidden services 14:50:01 while maintaining their anonymity and deniability 14:50:28 okay, explain more? (also, how is this related to tor metrics?) 14:50:59 implementation is a dead drop encrypted with the permanent key from the hidden service descriptor 14:51:37 hidden service operators* 14:52:05 and I'm not sure it's directly and immediately related to tor metrics 14:52:08 sounds more relevant to network team than metrics team 14:52:13 but still cool 14:52:16 yep. 14:52:27 it's something I've actuallly wanted with ONionLink 14:53:00 how about you bring this up again after the meeting, highlighting the usual suspect hidden service developers? 14:53:27 that sounds reasonable 14:53:32 ok! 14:53:39 we just ran out of agenda items. 14:53:46 anything else we should discuss in the remaining 5 minutes? 14:54:49 ok. no need to wait the full 5 minutes to find out. 14:54:54 thanks, everyone, for attending! 14:55:06 feel free to bring up questions and issues on metrics-team@. 14:55:11 next meeting in two weeks. cheers! 14:55:12 thanks for feedback 14:55:21 thanks for working on this, Letty! 14:55:24 #endmeeting