fn-serverless

mirror of https://github.com/fnproject/fn.git synced 2022-10-28 21:29:17 +03:00

Author	SHA1	Message	Date
Reed Allman	3ff28163db	fix task memory prior to this patch we were allowing 256MB for every function run, just because that was the default for the docker driver and we were not using the memory field on any given route configuration. this fixes that, now docker containers will get the correct memory limit passed into the container from the route. the default is still 128. there is also an env var now, `MEMORY_MB` that is set on each function call, see the linked issue below for rationale. closes #186 ran the given function code from #186, and now i only see allocations up to 32MB before the function is killed. yay. notes: there is no max for memory. for open source fn i'm not sure we want to cap it, really. in the services repo we probably should add a cap before prod. since we don't know any given fn server's ram, we can't try to make sure the setting on any given route is something that can even be run. remove envconfig & bytefmt this updates the glide.yaml file to remove the unused deps, but trying to install fresh is broken atm so i couldn't remove from vendor/, going to fix separately (next update we just won't get these). also changed the skip dir to be the cli dir now that its name has changed (related to brokenness). fix how ram slots were being allocated. integer division is significantly slower than subtraction.	2017-08-02 19:09:16 -07:00
Denis Makogon	edb1279d57	Fixing task deleting to be similar to task getter	2017-07-31 21:37:29 +03:00
Denis Makogon	333fc66906	Making http client global to async runner	2017-07-31 21:22:12 +03:00
Denis Makogon	7c22468b81	Addressing comments	2017-07-31 21:15:08 +03:00
Denis Makogon	10566ba1a8	Use better http client while deleting task	2017-07-31 21:14:12 +03:00
Denis Makogon	f1e46ebfe3	Using better HTTP client	2017-07-31 21:14:12 +03:00
Denis Makogon	2d7f54bffe	Addressing comments	2017-07-31 21:14:12 +03:00
Denis Makogon	bb8f12ece9	Fixing tests and CI file	2017-07-31 21:14:11 +03:00
Denis Makogon	721c0f1255	Improving erro handling while trying to reserve tasks at async runner Each time when MQ becomes unreachable HTTP GET /tasks returned HTTP 500 and code was not handling this case except expecting networking errors. After that it tried to unmarshal empty response body that caused another sort of an error. This patch triggers error based on http response code, explicitly checking if response code is something unexpected (not HTTP 200 OK). Response status code for /tasks for changed from 202 Accepted to 200 OK according to swagger doc.	2017-07-31 21:14:11 +03:00
Travis Reeder	48e3781d5e	Rename to GitHub (#3 ) * circle * Rename to github and fn->cli * Rename to github and fn->cli	2017-07-26 10:50:19 -07:00
Reed Allman	dc5e67b6d2	add opentracing spans for metrics	2017-07-25 08:55:22 -07:00
Travis Reeder	11c28e8846	Fixed input	2017-06-21 11:59:38 -07:00
Travis Reeder	8800ecc5c2	Merge branch 'func_logs2' into 'master' Func logs feature See merge request !66	2017-06-20 11:51:26 -07:00
Travis Reeder	8c96d3ba2f	Fixes async payload passing for #68 .	2017-06-20 11:32:51 -07:00
James	8a3edb8309	All of the changes for func logs	2017-06-19 11:38:11 -07:00
Reed Allman	75c5e83936	adds wait time based scaling across nodes this works by having every request from the functions server kick back a FXLB-WAIT header on every request with the wait time for that function to start. the lb then keeps track on a per node+function basis an ewma of the last 10 request's wait times (to reduce jitter). now that we don't have max concurrency it's actually pretty challenging to get the wait time stuff to tick. i expect in the near future we will be throttling functions on a given node in order to induce this, but that is for another day as that code needs a lot of reworking. i tested this by introducing some arbitrary throttling (not checked in) and load spreads over nodes correctly (see images). we will also need to play with the intervals we want to use, as if you have a func with 50ms run time then basically 10 of those will rev up another node (this was before removing max_c, with max_c=1) but in any event this wires in the basic plumbing. * make docs great again. renamed lb dir to fnlb * added wait time to dashboard * wires in a ready channel to await the first pull for hot images to count in the wait time (should be otherwise useful) future: TODO rework lb code api to be pluggable + wire in data store TODO toss out first data point containing pull to not jump onto another node immediately (maybe this is actually a good thing?)	2017-06-09 16:30:34 -07:00
Reed Allman	9edacae928	clean up hotf(x) concurrency, rm max c this patch gets rid of max concurrency for functions altogether, as discussed, since it will be challenging to support across functions nodes. as a result of doing so, the previous version of functions would fall over when offered 1000 functions, so there was some work needed in order to push this through. further work is necessary as docker basically falls over when trying to start enough containers at the same time, and with this patch essentially every function can scale infinitely. it seems like we could add some kind of adaptive restrictions based on task run length and configured wait time so that fast running functions will line up to run in a hot container instead of them all creating new hot containers. this patch takes a first cut at whacking out some of the insanity that was the previous concurrency model, which was problematic in that it limited concurrency significantly across all functions since every task went through the same unbuffered channel, which could create blocking issues for all functions if the channel is not picked off fast enough (it's not apparent that this was impossible in the previous implementation). in any event, each request has a goroutine already, there's no reason not to use it. not too hard to wrap a map in a lock, not sure what the benefits were (added insanity?) in effect this is marginally easier to understand and less insane (marginally). after getting rid of max c this adds a blocking mechanism for the first invocation of any function so that all other hot functions will wait on the first one to finish to avoid a herd issue (was making docker die...) -- this could be slightly improved, but works in a pinch. reduced some memory usage by having redundant maps of htfnsvr's and task.Requests (by a factor of 2!). cleaned up some of the protocol stuff, need to clean this up further. anyway, it's a first cut. have another patch that rewrites all of it but was getting into rabbit hole territory, would be happy to oblige if anybody else has problems understanding this rat's nest of channels. there is a good bit of work left to make this prod ready (regardless of removing max c). a warning that this will break the db schemas, didn't put the effort in to add migration stuff since this isn't deployed anywhere in prod... TODO need to clean out the htfnmgr bucket with LRU TODO need to clean up runner interface TODO need to unify the task running paths across protocols TODO need to move the ram checking stuff into worker for noted reasons TODO need better elasticity of hot f(x) containers	2017-06-05 20:04:13 -07:00
Denis Makogon	3f065ce6bf	[Feature] Function status	2017-06-06 14:12:50 -07:00
Chad Arimura	49d397293b	global url replace	2017-05-29 17:10:47 -07:00
Travis Reeder	69f0201818	Some small cleanup to docs.	2017-05-26 18:54:26 +00:00
James	e4bb04887e	Rewrite imports to use forks files on gitlab not use githubs.	2017-05-16 11:06:32 -07:00
Travis Reeder	4b9bba352d	Rename location.	2017-05-15 11:00:15 -07:00
Travis Reeder	d0ca2f9228	Moved runner into this repo, update dep files and now builds.	2017-04-21 07:42:42 -07:00
Travis Reeder	615ae5c36f	Mass s&r: iron-io -> kumokit	2017-04-19 09:49:12 -06:00
Denis Makogon	7603e6e8fa	Add idle_timeout to routes API (#603 ) * Add inactivity_timeout to routes API Closes: #544 * Fix failing datastore tests * Rename inactivity_timeout to idle_timeout * Update swagger doc * Update hot fn doc * Fix json tags * Add function timeouts docs * Rewording	2017-03-25 18:28:53 +01:00
C Cirello	ac0044f7d9	functions: hot containers (#332 ) * functions: modify datastore to accomodate hot containers support * functions: protocol between functions and hot containers * functions: add hot containers clockwork * fn: add hot containers support	2016-11-28 15:45:35 -02:00
Pedro Nasser	867eb4b176	Changes on function/metric loggers (#343 ) * initial fix logger * dix DefaultFuncLogger * fix runner and tests * reverting: sending async task stdout to func logger	2016-11-27 16:36:40 -02:00
C Cirello	f6d19c3cc9	functions: performance improvements - LRU & singleflight DB calls (#322 ) * functions: add cache and singleflight to ease database load * runner: upgrade * deps: upgrade glide files * license: add third party notifications * functions: fix handling of implicitly created apps * functions: code deduplication * functions: fix missing variable	2016-11-21 19:48:11 +01:00
C Cirello	9d06b6e687	functions: common concurrency stream for sync and async (#314 ) * functions: add bounded concurrency * functions: plug runners to sync and async interfaces * functions: update documentation about the new env var * functions: fix test flakiness * functions: the runner is self-regulated, no need to set a number of runners * functions: push the execution to the background on incoming requests * functions: ensure async tasks are always on * functions: add prioritization to tasks consumption Ensure that Sync tasks are consumed before Async tasks. Also, fixes termination races problems for free. * functions: remove stale comments * functions: improve mem availability calculation * functions: parallel run for async tasks * functions: check for memory availability before pulling async task * functions: comment about rnr.hasAvailableMemory and sync.Cond * functions: implement memory check for async runners using Cond vars * functions: code grooming - remove unnecessary goroutines - fix stale docs - reorganize import group * Revert "functions: implement memory check for async runners using Cond vars" This reverts commit 922e64032201a177c03ce6a46240925e3d35430d. * Revert "functions: comment about rnr.hasAvailableMemory and sync.Cond" This reverts commit 49ad7d52d341f12da9603b1a1df9d145871f0e0a. * functions: set a minimum memory availability for sync * functions: simplify the implementation by removing the priority queue * functions: code grooming - code deduplication - review waitgroups Waits	2016-11-18 18:23:26 +01:00
Carlos C	d5fb1afda7	Revert "Assert License (#224 )" This reverts commit `a61c4dab78`.	2016-11-06 09:25:12 -08:00
C Cirello	a61c4dab78	Assert License (#224 ) * license: assert license for Go code * license: add in shell scripts * license: assert license for Ruby code * license: assert license to individual cases * license: assert license to Dockerfile	2016-11-05 23:33:07 +01:00
Travis Reeder	c40ef433d8	Minor to fix comments in PR.	2016-10-13 20:24:06 -07:00
Travis Reeder	74402bdfea	Fixing tests.	2016-10-13 20:24:06 -07:00
Travis Reeder	3e443e604c	Added async post command to README.	2016-10-13 20:24:06 -07:00
Travis Reeder	25f582b180	Updated README and simplified/cleaned up some code.	2016-10-13 20:24:06 -07:00
C Cirello	df3d5b48ce	Fix race condition during initialization (#163 ) Currently, async workers are started before HTTP interface is available to get their requests. It fixes by ensuring that async workers are started after HTTP interface is up. Essentially we are getting rid of an error message during bootstrap: ERRO[0000] Could not fetch task error=Get http://127.0.0.1:8080/tasks: dial tcp 127.0.0.1:8080: getsockopt: connection refused	2016-10-13 22:56:34 +02:00
C Cirello	34b4b25092	Log messages cleanup (#158 )	2016-10-13 18:11:31 +02:00
Pedro Nasser	1ff480561a	add runTask test (#131 )	2016-10-12 22:32:06 +02:00
Seif Lotfy سيف لطفي	970412ba88	If no task if found don't print error (#130 )	2016-10-06 20:27:25 -03:00
C Cirello	3ca137a01c	Upgrade to Go 1.7 (#128 ) * Upgrade to stdlib context package * Modernized syntax	2016-10-06 20:10:00 +02:00
C Cirello	aa12f3c724	Add graceful shutdown support for async runners (#125 )	2016-10-06 00:32:56 +02:00
C Cirello	eed5422c59	Fix configuration load from environment variables (#123 ) Fix configuration load from environment variables Fixed #119	2016-10-05 19:10:21 +02:00
Carlos C	bc3fba088f	Fix start problem with two IronFunction colliding configurations By default, BoltDB will hang while waiting to acquire lock to the datafile, thus the users might find themselves waiting for something but not what. The added timeout aims inform use about what's happening. Also this renames MQADR to TASKSRV, refactor configuration to read environment variables. RunAsyncRunner now fills the gaps when parsing TASKSRV. Fixes #119	2016-10-04 23:57:43 +02:00
Seif Lotfy	d8801d5be7	Refactor async runner and its reference and add unit tests	2016-09-30 23:29:59 +02:00
Seif Lotfy	f5c4f5f7a9	Fix unit tests due to missing mock MQ	2016-09-30 23:04:56 +02:00
Seif Lotfy	f6f160cef9	Move async runner to runner file	2016-09-26 15:11:36 +02:00

46 Commits