18:01:31 #startmeeting gitlab migration meeting 18:01:31 Meeting started Wed Apr 15 18:01:31 2020 UTC. The chair is ahf. Information about MeetBot at http://wiki.debian.org/MeetBot. 18:01:31 Useful Commands: #action #agreed #help #info #idea #link #topic. 18:01:40 sorry, i ended up in a discussion in another room 18:01:51 gaba is missing, she had to run somewhere 18:01:59 let me just find the pad and stuff 18:02:31 https://pad.riseup.net/p/gitlab-migration-problems is the pad 18:02:52 so friday was not my most active day, since i spend it with family in the distance since i incremented my birth counter 18:03:27 i did look a bit more at the wiki stuff and stole some parts from tracboat (again), but the list of tickets of course being the big problem with migration there 18:03:36 i think last week a lot of progress was made on the irc bot though? 18:04:26 what's the list of tickets prblem sorry? 18:04:32 yes, progress on the bot: it's up and running! :) 18:04:44 the list is on the pad 18:04:53 if you do foo/bar#1234 in a zwiebelbot tor channel, it will poke at gitlab.tpo/foo/bar/issues/1234 18:04:57 heh 18:05:10 i guess i just found a bug :p 18:05:29 hm, that is neat 18:05:40 can it be done to not need foo/bar ? 18:05:45 or is it just that format one should use? 18:05:46 #info zwiebelbot has been patched to support gitlab, but some issues remain: private or non-existent issues are not handled 18:06:03 ahf: #foo will work for the legacy tickets, but it can't possibly work for all projects 18:06:28 so we can make #xyz work to be like legacy/legacy/xyz ? 18:06:37 yes, eventually 18:06:46 for now that still points at trac, but we'll make the switch after the migration 18:07:06 sweet 18:07:08 that is really nice 18:07:43 ggus: was mentioned by pili to talk about cypherpunks 18:07:47 yes 18:07:48 i'm still confused as to what "list of tickets" means 18:07:57 yes thank you ahf and ggus 18:07:57 anarcat: my todo list is on the pad 18:08:03 it's not on trac if that is what you are asking 18:08:03 * pili jumps back to her other meeting 18:08:05 ahf: yes, i see the pad 18:08:18 there is an "ISSUES NOT RESOLVED" 18:08:28 oh you're saying that the problem with the migration is the remaining not-resolved-issues? 18:08:30 got it 18:08:36 yeah, formatting issues sorry 18:08:37 anarcat: right, i found that bug earlier because prop#300 18:08:39 it is all formatting things 18:08:42 next week we will release community portal -community.tpo and we would like to collect feedback (ie bug report) on gitlab. do you htink it's possible to have an account like cypherpunks? 18:08:51 (ok there is no gitlab issue 300 yet) 18:09:00 ggus: i think there is people who thinks it is a bad idea 18:09:04 partially myself including 18:09:15 on gitlab we cannot restrict an account very much 18:09:28 so for example the first troll could just login and change the pw 18:09:40 arma2: not sure i follow, you're saying the gitlab redirector overlaps with prop patterns? 18:09:42 or delete the old comments from the other user 18:10:00 ggus: i understood that we were using RT for that now? 18:10:10 didn't we create queues for that already? or is that a different process? 18:10:21 in any case, i agree with ahf that gitlab is not the right place for now 18:10:29 anarcat: different things. we created for training@ 18:10:35 ggus: realistically i think it will be too early to experiment with that 18:10:45 gitlab recently announced that they are open sourcing more private components 18:10:52 so there will be a user support system 18:10:54 ahf: ok, do you think it migh make sense to point to trac? 18:10:56 where people can submit things anonymously 18:11:08 ggus: i think a form that you guys an email might be the best? 18:11:12 i don't know much about trac 18:11:15 except the api :-/ 18:12:01 ggus: trac could work, but i'd suggest we discuss this outside this meeting and just say "not gitlab" for now :) 18:12:16 we discussed three options: a) cypherpunk account on gitlab; b) cypherpunk account on trac.tpo ; c) a feedback survey on survey.tpo 18:12:35 survey.tpo sounds like what you need :) 18:12:41 but in any case it's not a) 18:12:53 anarcat: (i will tell you about the bug post-meeting. no need to clutter things now.) 18:12:53 antonela: pili: ^ 18:12:58 ahf: can we set an agenda for this meeting? i feel we're a bit all over the place :) 18:13:01 ahf: thanks 18:13:06 er 18:13:07 arma2: thanks 18:14:00 anarcat: i think we are at it. i think the next steps now would be to talk about next steps in migration, but i will need gaba there 18:14:07 and i don't have much of an update due to friday 18:14:10 so i think we can just call it 18:14:37 ggus: oki 18:15:04 should we call it and aim for same time next week? 18:15:16 ahf: so i think we need to talk about 1) status update (what we're doing now) 2) next steps in migration 3) more specifically, next steps in infra plan 18:15:31 so we'd be done with the stauts update and can talk about the next steps 18:15:50 (2) is "continue removing formatting issues" and "testing wiki migration" 18:15:57 (3) i am not sure if there is anything 18:15:58 ? 18:16:05 i had a question about the pad actually 18:16:07 as far as i know the service seems to be running stable and so on right now? 18:16:10 lin94 you say *not possible ASFAIK* 18:16:17 what does that refer to? 18:16:34 ah 18:16:37 the image stuff? 18:16:51 oh wait did you just update the status there? :) 18:16:52 it is a note saying that i am not sure we can convert the inline Image() tags from trac to something sensible with attachments in GL 18:16:59 right 18:17:01 no? 18:17:05 ok 18:17:13 so markdown supports images and we can definitely put images in gitlab comments 18:17:25 so we should be able to translate between those two, no? 18:18:37 hm, that one should be updated after friday actually, because now i do the attachments at the same time as i do the ticket 18:18:44 like i fetch attachments first, then post ticket to gitlab 18:18:44 awesome 18:18:54 so attachements and images should work... 18:18:58 i have test tickets for those if you like 18:19:10 yeah 18:19:13 there is in the pad 18:19:16 let me know if you want me to dig some out, not sure if i can find them quickly, but i definitely used that feature 18:19:29 #30936 18:19:33 is one of them 18:19:44 okay 18:19:58 seems like you got it covered 18:20:03 so i guess we can move on to point 3) 18:20:06 which i would like to talk about :) 18:20:14 the service is running stable now 18:20:16 but we don't have users 18:20:21 and i think it will not scale to many users 18:20:45 i have only hosted small (few users) gitlabs before, so i have not much clue on that 18:20:48 this is gitlab right now 18:20:49 Instance Hypervisor OS Primary_node Status Memory 18:20:54 gitlab-02.torproject.org kvm debootstrap+buster fsn-node-03.torproject.org running 12.0G 18:20:55 gah 18:21:23 Instance ConfigVCPUs ConfigMaxMem Disk_sizes 18:21:23 gitlab-02.torproject.org 2 12.0G 30720,2048,20480 18:21:45 it's memory, disk, swap? 18:21:57 it has as much memory as cupani, vineale and troodi (git-rw, gitweb and trac) together, but only two CPUs (vs 10 total for the other three) 18:22:21 Disk_sizes is basically / (root fs), swap, /srv 18:22:34 /srv contains the repos? 18:22:39 i am guessing yeah 18:22:51 this is the legacy infra: 18:22:52 Instance ConfigVCPUs ConfigMaxMem Disk_sizes 18:22:52 cupani.torproject.org 2 2.0G 20480,4096,40960 18:22:52 troodi.torproject.org 4 4.0G 20480,8192,51200 18:22:52 vineale.torproject.org 4 8.0G 20480,4096,40960 18:23:13 cupani is: 18:23:14 Filesystem Size Used Avail Use% Mounted on 18:23:14 /dev/mapper/vg_cupani-srv 39G 31G 5.9G 84% /srv 18:23:21 and your concern here is that it is now using a lot or it is now using a lot and in the future it might need even more? 18:23:25 so we already have a problem there that /srv is not big enough to have a copy of cupani 18:23:29 so we'll need to scale that up 18:23:42 my concern is that we haven't looked at this problem at all 18:23:50 is all of that git repos? 18:23:57 ah, the browser tree is of course yuge 18:24:00 yep 18:24:03 makes sense 18:24:22 i'd be surprised, actually, if gitlab stays at 30GB 18:24:31 i expect it to explode in disk usage over time 18:24:41 yeah because people will be much more just creating repos there 18:24:44 and throw stuff into it 18:24:49 today there is a bit of a process to get a repo 18:24:51 especially if we start CI, artifacts storage and containers registries 18:24:56 yep 18:25:03 but yeah, just git will eat a lot of I/O 18:25:19 both in terms of disk usage but also in terms of actual I/O in the cluster 18:25:23 that's one thing i'm worried about 18:25:26 i'm also worried about CPU usage 18:25:54 but the point is more that this is all an analysis that Someone(tm) needs to take care of 18:26:04 so far i've been kind of assuming that Someone Else(tm) would take care of it 18:26:21 but right now it certainly feels that if i don't do it, no one will and everyone kind of assumes we'll just wing it :p 18:26:32 hm 18:26:42 CI is not gonna be on that one for sure 18:26:52 artifact storage and container storage i don't even have on my mind yet 18:27:09 the CI is a bit of a security nightmare in my world so i want that off everything else 18:27:29 right 18:27:30 i think you are right to think the someone else thing 18:27:46 i think that is fine right now, you are already contributing a lot in these conversations 18:28:11 okay :) 18:28:20 well i just want to add that to our launch checklist i guess 18:28:22 we need to 18:28:24 so 18:28:31 you know we have the gift cards to digital ocean? 18:28:37 or not gift cards, but a sponsorship 18:28:44 that i haven't sorted out, but that dgoulet got us setup with 18:28:47 1. have someone talk with other gitlab providers (riseup, debian, gnome) and see what kind of traffic they're seeing and hardware they're using 18:29:00 i was hopeing to use that for CI, but since we don't have CI right now other than jenkins, CI is not on my priority list right now 18:29:02 but is on nice to have 18:29:11 2. plan extra hardware and related finances to make sure we can handle the load after launch 18:29:16 3. deploy that 18:29:34 2 and 3, by the way, might involve splitting the service into multiple servers (say run gitaly elsewhere) 18:29:42 and yes 18:29:47 is micah the riseup gitlab person? 18:29:58 i want to make it really crystal clear to everyone that we don't do CI just yet, that's another launch 18:30:16 yes! 18:30:17 agreed 18:30:22 as much as i'd like to retire jenkins, i don't think we should do it as we retire trac :) 18:30:29 one retirement at a time :) 18:30:40 yep 18:30:41 agreed 18:30:41 i am not sure about gitlab 18:30:43 we are in on that 18:30:45 but i can ask them 18:30:53 point is: who's bottomlining that side? 18:31:20 bottomlining? 18:31:53 uh 18:31:59 "making sure this happens"? 18:32:20 a "bottomliner" makes sure things are taken care of, even if they don't necessarily do the work themselves 18:32:27 coordinating i guess is a good approximation 18:32:37 are you up for talking with them about this? 18:32:43 i think you have a better overview of the concerns there 18:32:49 hiro knows some scaling issues from oniongit i guess 18:33:16 i guess i can bounce that problem around with hiro and see which lap it falls on at the end :p 18:33:33 i am also wondering if we have any sort of budget for this launch in terms of hardware provisionning 18:33:49 i would love to be able to throw hardware at this problem if we need to 18:34:00 as opposed to spending days of staff work trying to fine-tune gitlab's performance 18:34:07 so it is good you are a bit more balanced on this because i'm a bit of a yes hat person in all of this and it might be there are some performance issues that is worth thinking into it 18:34:11 that i am not seeing 18:34:20 i have no clue on budget here 18:34:25 that's because you're not dealing with trac crap right now :) 18:34:31 you would definitely be worried if you would 18:34:33 i think once we are migrated, we can get rid of some of the resources from the other services 18:34:40 yep! 18:34:41 for sure 18:36:23 crawling is an issue with trac, no? 18:36:26 yes 18:36:29 it will be worse with gitlab 18:36:31 oh 18:36:33 i would be 100% sure that gitlab probably have that issue too 18:36:34 and monitoring 18:36:50 if i google torproject stuff now i already see a lot of things on gitlab.torproject.org 18:36:50 we need monitoring, particularly hooking gitlab into prometheus and grafana 18:36:52 we have nothing right now 18:36:53 like individual commits and such 18:37:07 we should probably disable robots.txt because every call to many of the sites are probably expensive git operations 18:37:36 i'll open tickets 18:37:54 awesome 18:38:55 https://trac.torproject.org/projects/tor/ticket/33921 18:39:40 Can't you just stick it behind caching? Or are these idiosyncratic operations? 18:39:49 awesome 18:40:04 yanmaani: maybe putting varnish in front could be a solution, but i don't know 18:40:11 maybe just blocking crawling in the beginning is fine 18:40:24 we already have nginx in front of it, and yes we could setup caching there 18:40:26 Do Things That Don't Scaleā„¢ 18:40:40 but as i said, that falls in the "let's spend a week tuning gitlab" solution :p 18:40:49 (not sarcasm) 18:40:49 as opposed to "throw hardware at the problem" solution 18:41:00 https://trac.torproject.org/projects/tor/ticket/33922 18:41:08 alright, we can move the discussion to async 18:41:14 i'm happy with the meeting, and would move on :) 18:42:02 oki 18:42:05 let's close it down folks 18:42:10 meeting in 1 week at same time 18:42:21 thanks ahf 18:42:23 thanks folks, i will talk with gaba 18:42:25 #endmeeting