Commit Graph

1677 Commits

Author SHA1 Message Date
Tolga Ceylan
c89f1e5f9c fn: safer hand over between monitoring and main processing (#1316)
In runHot(), it's safer to use a separate channel between
monitoring go-routine and processing go-routine to handle
cancellations triggered by monitorin go-routine.
2018-11-15 16:57:16 -08:00
CI
935162ec6a fnserver: 0.3.616 release [skip ci] 2018-11-15 21:46:00 +00:00
Tolga Ceylan
6eaf1578e6 fn: container initialization monitoring (#1288)
Container initialization phase consumes resource tracker
resources (token), during lengthy operations.
In order for agent stability/liveness, this phase has
to be evictable/cancelable and time bounded.

With this change, introducing a new system wide environment setting
to bound the time spent in container initialization phase. This phase
includes docker-pull, docker-create, docker-attach, docker-start
and UDS wait operations. This initialization period is also now
considered evictable.
2018-11-15 13:37:43 -08:00
Tolga Ceylan
fe2b9fb53d fn: cookie and driver api changes (#1312)
Now obsoleted driver.PrepareCookie() call handled image and
container creation. In agent, going forward we will need finer
grained control over the timeouts implied by the contexts.
For this reason, with this change, we split PrepareCookie()
into Validate/Pull/Create calls under Cookie interface.
2018-11-14 16:51:05 -08:00
CI
65f3f915be fnserver: 0.3.615 release [skip ci] 2018-11-14 19:46:49 +00:00
Tolga Ceylan
8ee4c1098b fn: correct typo in docker command tag (#1311) 2018-11-14 11:38:48 -08:00
CI
325f28ef89 fnserver: 0.3.614 release [skip ci] 2018-11-14 17:31:04 +00:00
Eric Fode
90e39c8fd3 initial addition of the diskfree op (#1308)
* initial addition of the diskfree op

fixing up some typos

last of fmt errors

* fixed up some feedbacks
2018-11-14 09:22:07 -08:00
CI
3486af2981 fnserver: 0.3.613 release [skip ci] 2018-11-09 18:33:23 +00:00
Andrea Rosa
182db94fad Feature/acksync response writer (#1267)
This implements a "detached" mechanism to get an ack from the runner
once it actually starts to run a function. In this scenario the response
returned back is just a 202 if we placed the function in a specific
time-frame. If we hit some errors or we fail to place the fn in time we
return back different errors.
2018-11-09 10:25:43 -08:00
CI
2df6c8d349 fnserver: 0.3.612 release [skip ci] 2018-11-07 20:43:19 +00:00
Tolga Ceylan
25afb2f478 fn: remove tini option & env variable (#1301) 2018-11-07 12:35:19 -08:00
CI
46c25215f4 fnserver: 0.3.611 release [skip ci] 2018-11-07 19:18:15 +00:00
CI
9e77f2b9a0 fnserver: 0.3.610 release [skip ci] 2018-11-06 00:09:27 +00:00
Tolga Ceylan
975b780695 fn: tests for hung and bad docker repo during docker-pull (#1298)
* fn: tests for hung and bad docker repo during docker-pull
2018-11-05 16:01:42 -08:00
CI
9b50eaddf1 fnserver: 0.3.609 release [skip ci] 2018-11-05 17:49:33 +00:00
Harry Smith
3f0d4804b2 Fix filtering by name when getting list of funcs (#1295)
* Fix filtering by name when getting list of funcs

* Add datastore test for function list name filter

* Fix tests
2018-11-05 09:41:59 -08:00
CI
3a383049f9 fnserver: 0.3.608 release [skip ci] 2018-11-02 21:10:39 +00:00
Tolga Ceylan
5415b2bc38 fn: move UDS client into container to keep runHot() simpler (#1297) 2018-11-02 14:03:09 -07:00
CI
92c7723997 fnserver: 0.3.607 release [skip ci] 2018-11-02 20:39:53 +00:00
Tolga Ceylan
ac17825a36 fn: add container state to eviction stats (#1296) 2018-11-02 13:32:13 -07:00
CI
494fb1827b fnserver: 0.3.606 release [skip ci] 2018-11-01 21:30:28 +00:00
Tolga Ceylan
de9c2cbb63 fn: cleanup of docker timeouts and docker health check (#1292)
Moving the timeout management of various docker operations
to agent. This allows for finer control over what operation
should use. For instance, for pause/unpause our tolerance
is very low to avoid resource issues. For docker remove,
the consequences of failure will lead to potential agent
failure and therefore we wait up to 10 minute.
For cookie create/prepare (which includes docker-pull)
we cap this at 10 minutes by default.

With new UDS/FDK contract, health check is now obsoleted
as container advertise health using UDS availibility.
2018-11-01 14:22:47 -07:00
CI
1e3104c649 fnserver: 0.3.605 release [skip ci] 2018-10-31 21:46:47 +00:00
CI
5c72e476f5 fnserver: 0.3.604 release [skip ci] 2018-10-30 19:19:05 +00:00
Tolga Ceylan
e227802512 fn: Remove error channel for container exits (#1287)
The channel is unnecessary and unreliable since exits
trigger I/O failure on UDS earlier than we detect
the exit.
2018-10-30 12:11:23 -07:00
CI
e01243fd40 fnserver: 0.3.603 release [skip ci] 2018-10-26 17:52:11 +00:00
Reed Allman
e13a6fd029 death to format (#1281)
* get rid of old format stuff, utils usage, fix up for fdk2.0 interface

* pure agent format removal, TODO remove format field, fix up all tests

* shitter's clogged

* fix agent tests

* start rolling through server tests

* tests compile, some failures

* remove json / content type detection on invoke/httptrigger, fix up tests

* remove hello, fixup system tests

the fucking status checker test just hangs and it's testing that it doesn't
work so the test passes but the test doesn't pass fuck life it's not worth it

* fix migration

* meh

* make dbhelper shut up about dbhelpers not being used

* move fail status at least into main thread, jfc

* fix status call to have FN_LISTENER

also turns off the stdout/stderr blocking between calls, because it's
impossible to debug without that (without syslog), now that stdout and stderr
go to the same place (either to host stderr or nowhere) and isn't used for
function output this shouldn't be a big fuss really

* remove stdin

* cleanup/remind: fixed bug where watcher would leak if container dies first

* silence system-test logs until fail, fix datastore tests

postgres does weird things with constraints when renaming tables, took the
easy way out

system-tests were loud as fuck and made you download a circleci text file of
the logs, made them only yell when they goof

* fix fdk-go dep for test image. fun

* fix swagger and remove test about format

* update all the gopkg files

* add back FN_FORMAT for fdks that assert things. pfft

* add useful error for functions that exit

this error is really confounding because containers can exit for all manner of
reason, we're just guessing that this is the most likely cause for now, and
this error message should very likely change or be removed from the client
path anyway (context.Canceled wasn't all that useful either, but anyway, I'd
been hunting for this... so found it). added a test to avoid being publicly
shamed for 1 line commits (beware...).
2018-10-26 10:43:04 -07:00
CI
7fd61054b0 fnserver: 0.3.602 release [skip ci] 2018-10-25 19:25:33 +00:00
Tolga Ceylan
241d3fede1 fn: blocking mode should not emit 503 if can't evict (#1283) 2018-10-25 12:17:26 -07:00
CI
5228269e15 fnserver: 0.3.601 release [skip ci] 2018-10-25 10:18:23 +00:00
Tolga Ceylan
bf41789af2 fn: eviction resource correction (#1282)
Previously evictor did not perform an eviction
if total cpu/mem of evictable containers was less
than requested cpu/mem. With this change, we
try to perform evictions based on actual needed cpu & mem
reported by resource tracker.
2018-10-25 11:10:19 +01:00
CI
93777daff1 fnserver: 0.3.600 release [skip ci] 2018-10-19 23:11:01 +00:00
CI
a53b6a6199 fnserver: 0.3.599 release [skip ci] 2018-10-18 22:18:35 +00:00
Tolga Ceylan
8fe1c9a07c fn: reduce logging for evicted containers (#1276)
Let's not log evicted containers which would be context
canceled.
2018-10-18 15:10:15 -07:00
CI
9e87e86a84 fnserver: 0.3.598 release [skip ci] 2018-10-17 15:26:17 +00:00
Tom Coupland
ceb2a1fc8a Configure logrus to include nano seconds in log messages (#1273)
Currently the default time format, time.RFC3339, is used, which doesn't include any
subsecond resolution information. This makes it hard to understand the
ordering of log messages when viewing in a log aggregator, like
Kibana.

This change sets the TimestampFormat of the logrus JSONFormatter to
time.RFC3339Nano.
2018-10-17 16:17:00 +01:00
CI
f84ae832b7 fnserver: 0.3.597 release [skip ci] 2018-10-17 00:12:23 +00:00
James Jeffrey
fdaa776c9f Update promclient and its usage (#1272) 2018-10-16 17:03:49 -07:00
CI
d2629caed6 fnserver: 0.3.596 release [skip ci] 2018-10-15 19:23:23 +00:00
Tolga Ceylan
44e366d195 fn: add details to runner finish logging (#1271)
Adding http-status/fn-http-status details in runner finish
logger.
2018-10-15 12:15:08 -07:00
CI
50643cfe23 fnserver: 0.3.595 release [skip ci] 2018-10-06 00:10:22 +00:00
Tolga Ceylan
f10fab21bc fn: fixup possible go-routine leak (#1265) 2018-10-05 17:02:18 -07:00
CI
b7d53332d3 fnserver: 0.3.594 release [skip ci] 2018-10-05 23:40:15 +00:00
Reed Allman
e6eec186d0 small tweaks to dispatch (#1264)
* the dispatch span actually encloses dispatch and gives an accurate span now
* turning a call into an http request can't fail unless it's our fault, if
tests don't catch this, we don't deserve money
* moved http req creation inside of dispatch goroutine

there's further work to do cleaning up dispatch... removing the old formats
will make this slightly more clear, waiting for that. this was bugging me
anyway after seeing something else and was easy to fix up.
2018-10-05 16:32:01 -07:00
CI
34e40256d9 fnserver: 0.3.593 release [skip ci] 2018-10-05 02:02:35 +00:00
Tolga Ceylan
29dcf0a791 fn: adding docker events to stats (#1262)
Streaming docker events is useful as we can record/capture some
asynchronous containers events such as out-of-memory. For now,
we record these in opencensus/prometheus stats.
2018-10-04 18:54:09 -07:00
CI
ec2f9539f2 fnserver: 0.3.592 release [skip ci] 2018-10-04 23:06:37 +00:00
Tolga Ceylan
5a9118ff32 fn: default fnserver tag keys and api key adjustment (#1261)
Default fn server keys should be minimal (empty) since not
all stats have associated app name, fn id, etc.

API tags for requests should not include "status" as this is
part of responses.
2018-10-04 15:58:21 -07:00
CI
0595e9b1c8 fnserver: 0.3.591 release [skip ci] 2018-10-01 23:24:21 +00:00