0% found this document useful (0 votes)

0 views70 pages

Stream Processing

The document discusses stream processing algorithms, focusing on data stream management systems that handle rapid input from various sources like web traffic and sensors. It covers the challenges of processing streams with limited memory, the importance of sampling and filtering techniques, and the use of standing and ad-hoc queries. Additionally, it introduces concepts like Bloom filtering for efficient data handling and provides real-life applications of stream processing.

Uploaded by

SARANYA M -77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views70 pages

Stream Processing

Uploaded by

SARANYA M -77

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 70

Stream Processing

Algorithms

1
AGENDA
• Introduction
• Data Stream Management
• Real life applications
• Streaming Queries
• Issues
• Sampling
• Filtering
• Counting Distinct Elements
• Moments

2
Introduction

• In a DBMS, input is under the control of the

programmer.

• Stream Management is important when the input rate

is controlled externally.

• Example: Google queries.

3
The Stream Model

• Input tuples enter at a rapid rate, at one or more

input ports.
• The system cannot store the entire stream
accessibly.
• How do you make critical calculations about the
stream using a limited amount of (secondary)
memory?

4
Data Stream Management
system
 Stream processor – a kind of Data management system
 Any number of streams can enter the system.
 Each stream can provide elements at its own schedule.
 Need not have the same data rates or data types.
 Time between elements of one stream need not be
uniform.
 Rate of arrival of stream elements is not under the control
of the system distinguishes stream processing from DBMS.
5
Ad-Hoc
Queries

. . . 1, 5, 2, 7, 0, 9, 3 Standing
Queries
Output
. . . a, r, v, t, y, h, b
Processor
. . . 0, 0, 1, 0, 1, 1, 0
time

Streams Entering

Limited
Working
Storage Archival
Storage

6
Data Stream Management system-
contd..
 DBMS controls the rate at which data is read from the disk &
therefor never has to worry about data getting lost as it attempts to
execute queries.
 Stream storing – i) Archival store
ii) Working store
 Archival store
– larger
- could be examined only under special circumstances using
time-consuming retrieval processes
7
Data Stream Management system-
contd..
 Working store
- summaries or parts of streams may be placed
- can be used for answering queries
- might be disk or main memory, depending
on how fast we need to process queries
- of sufficiently limited capacity that it cannot
store all the data from all the streams.
8
Real Life Applications
• Web traffic
• Internet
• Sensor data
• Image data

9
Applications -Web traffic

• Web sites receive streams of various types.

Mining query streams
• Google wants to know what queries are more frequent today than
yesterday.
Mining click streams.
• Yahoo wants to know which of its pages are getting an unusual
number of hits in the past hour
• Many interesting things can be learnt- Eg: an increase in
queries like “dengue fever symptoms” enables us to
predict the number of sufferers.

10
Applications - Web traffic –
contd…
• A sudden increase in the click rate for a link - -
could indicate either of the following two:

•some news connected to that page,

•link is broken and needs to be repaired

11
Applications – Sensors
• Consider a temperature sensor bobbing about in the
ocean.

• Sending reading of the surface temperature to base

station each hour .

• Data produced by this sensor -stream of real

numbers

• Data rate is so low.

12
Applications – Sensors –
contd…
• If a GPS unit is attached to the sensor to report surface
height instead of temperature.

• The surface height varies quite rapidly.

• So the sensor would send back a reading every tenth of a

second.

• If it sends a 4-byte real number each time, then it

produces 3.5 megabytes per day.

• To learn something about ocean behavior, it is necessary

to deploy a million sensors, each sending back a stream,
at the rate of ten per second. 13
Applications – Sensors –
contd…
• There would be one for every 150 square miles of ocean,
million sensors are used.

• Then 3.5 terabytes of data arriving every day.

• We definitely need to think about

- what can be kept in working storage and
- what can only be archived.

14
Applications – Image data
• Satellites often send down to earth streams consisting
of many terabytes of images per day.

• Surveillance cameras produce images with lower

resolution than satellites, but there can be many of
them, each producing a stream of images at intervals
like one second.

• London is said to have six million such cameras, each

producing a stream

15
Stream Queries
There exist two ways of querying about streams :

i)Standing query
ii)Ad-hoc query

16
Standing Queries

• A place within the processor where standing queries are

stored.

• These queries are, in a sense, permanently executing, and

produce outputs at appropriate times.

17
Standing Queries
Eg1 :
• Stream produced by the ocean-surface-temperature
sensor.

•A standing query to output an alert whenever the

temperature exceeds 25 degrees centigrade.

•This query is easily answered, since it depends only on

the most recent stream element.

18
Standing Queries – contd…
• Eg2 :
• Another query - the maximum temperature ever recorded by that
sensor.

• We can answer this query by retaining a simple summary: the

maximum of all stream elements ever seen.

• Not necessary to record the entire stream.

• When a new stream element arrives, we compare it with the

stored maximum and set the maximum to whichever is larger.

19
Standing Queries – contd…
• Eg 3 :
• We want the average temperature over all time.

• We have only to record two values: the number of

readings ever sent in the stream and the sum of those
readings.

• Adjust these values easily each time a new reading

arrives.

• We can produce their quotient as the answer to the

query.
20
Ad-hoc Queries
• A question asked once about the current state of a
stream or streams.

• If we do not store all streams in their entirety, then we

cannot answer arbitrary queries about streams.

• If we have some idea what kind of queries will be

asked, then prepare by storing appropriate parts or
summaries of streams.

• To satisfy a wide variety of ad-hoc queries, a common

approach - to store a sliding window of each stream in
the working store. 21
Sliding Windows
• A useful model of stream processing is that queries
are about a window of length N – the N most recent
elements received.

• Interesting case: N is so large it cannot be stored in

memory, or even on disk.

• Or, it can be all the elements that arrived within the

last t time units, e.g., one day
• Or, there are so many streams that windows for all cannot be
stored

22
qwertyuiopasdfghjklzxcvbnm

qwertyuiopasdfghjklzxcvbnm

Past Future

23
Eg:
Web sites often like to report the number of unique users over
the past month.
•If we think of each login as a stream element,
• we can maintain a window that is all logins in the most
recent month.
• We must associate the arrival time with each login, so we
know when it no longer belongs to the window.
•If we think of the window as a relation Logins(name, time), then
it is simple to get the number of unique users over the past
month.
The SQL query is:
SELECT COUNT(DISTINCT(name))
FROM Logins
WHERE time >= t;
Here, t is a constant that represents the time one month before
the current time.

24
Issues in stream
processing
• Streams often deliver elements very rapidly.

• We must process elements in real time, or we lose the

opportunity to process them at all, without accessing the archival
storage.

• It is important that the stream-processing algorithm is executed

in main memory, without access to secondary storage or with
only rare accesses to secondary storage.

25
Issues in stream processing –
contd…
• Even when streams are “slow,” as in the sensor-data
example, there may be many such streams.

• Even if each stream by itself can be processed using a

small amount of main memory, the requirements of all the
streams together can easily exceed the amount of available
main memory.

26
Issues in stream processing –
contd…
• Many problems about streaming data are solved if we had
enough memory.

• New techniques are required in order to execute them at a

realistic rate on a machine of realistic size.

• Two generalizations about stream algorithms:

• It is much more efficient to get an approximate answer to
our problem than an exact solution
• A variety of techniques related to hashing are useful -
introduce useful randomness into the algorithm’s
behavior, to produce an approximate answer that is very
close to the true result
27
Sampling Data in a Stream
• Extracting reliable samples from a stream.

• If we know what queries are to be asked, then there are a

number of methods for sampling.

• If looking for a technique that will allow ad-hoc queries on

the sample.

• Eg : A search engine receives a stream of queries, and it would like to

study the behavior of typical users.
Assume that the stream consists of tuples (user, query, time) 28
Sampling Data in a Stream - Example
• Suppose that we want to answer queries such as “What
fraction of the typical user’s queries were repeated over the
past month?”

• Assume also that we wish to store only 1/10th of the stream

elements.

• The obvious approach would be to generate a random

number, say an integer from 0 to 9, in response to each
search query.

• Store the tuple if and only if the random number is 0.

• If we do so, each user has, on average, 1/10th of their
queries stored. 29
Obtaining a Representative
sample
• Previous example cannot be answered by taking a sample
of each user’s search queries.

• we should strive to pick 1/10th of the users, and take all

their searches for the sample.

• If we can store a list of all users, and whether or not they

are in the sample, then we could do the following:

• Each time a search query arrives in the stream, we look up

the user to see whether or not they are in the sample
30
Obtaining a Representative sample – contd…
• That method works as long as we can keep the list of all users and
their in/out decision in main memory (because there isn’t time to go
to disk for every search that arrives).

• By using a hash function, one can avoid keeping the list of users.

• ie. we hash each user name to one of ten buckets, 0 through 9.

• If the user hashes to bucket 0, then accept this search query for the
sample, and if not, then not.

31
Obtaining a Representative sample –
contd…
• More generally, we can obtain a sample consisting of any rational
fraction a/b of the users by hashing user names to b buckets, 0
through b − 1.

• Add the search query to the sample if the hash value is less than a.

32
Varying Sample Size

• Sample will grow as more of the stream enters.

• In our running example, we retain all the search queries of the selected

1/10th of the users, forever

• As time goes on, more searches for the same users will be

accumulated, and new users that are selected for the sample will appear

in the stream

33
Exercise Problem

• Suppose we have a stream of tuples with the schema

Grades(university, courseID, studentID, grade)
Assume universities are unique, but a courseID is unique only within a
university (i.e., different universities may have different courses with
the same ID, e.g., “CS101”) and likewise, studentID’s are unique only
within a university.
(different universities may assign the same ID to different students).

Suppose we want to answer certain queries approximately from a 1/20 th

sample of the data.

34
Exercise Problem contd…

For each of the queries below, indicate how you would construct the
sample. That is, tell what the key attributes should be.

• For each university, estimate the average number of students in a

course.

• Estimate the fraction of students who have a GPA of 3.5 or more.

• Estimate the fraction of courses where at least half the students got
“A.”

35
Filtering streams
• Common process on streams is selection, or filtering.

• Accept those tuples in the stream that meet a criterion.

• Accepted tuples are passed to another process as a stream, while

other tuples are dropped.

• If the selection criterion is a property of the tuple that can be

calculated (e.g., the first component is less than 10), then the
selection is easy to do.

• Problem becomes harder when the criterion involves lookup for

membership in a set

36
Filtering streams – contd…

• It is especially hard, when that set is too large to store in

main memory.

• Technique known as “Bloom filtering” - a way to

eliminate most of the tuples that do not meet the criterion

37
Bloom Filtering – Example

• Suppose we have a set S of one billion allowed email addresses –

believed not to be spam.

• Stream consists of pairs: an email address and the email itself.

• Typical email address is 20 bytes or more, it is not reasonable to store S

in main memory.

• We can either use disk accesses to determine whether or not to let

through any given stream element.

• Or we can devise a method that requires no more main memory than

available, and filters most of the undesired stream elements.
38
Bloom Filtering – Example
• Suppose one gigabyte of main memory is available.
• In Bloom filtering, that main memory is used as a bit
array.
• Room for eight billion bits, since one byte equals eight
bits
• Devise a hash function h from email addresses to eight
billion buckets.

• Hash each member of S to a bit, and set that bit to 1, all

other bits of the array remain 0

39
Bloom Filtering – Example
• Since there are one billion members of S, approximately 1/8th of
the bits will be 1.

• Exact fraction of bits set to 1 will be slightly less than 1/8th,

because it is possible that two members of S hash to the same bit.

• When a stream element arrives, we hash its email address.

• If the bit to which that email address hashes is 1, then we let the
email through.

• But if the email address hashes to a 0, we can drop this stream

element

40
Bloom Filtering – Example
• Unfortunately, some spam email will get through
• Approximately 1/8th of the stream elements whose email
address is not in S will happen to hash to a bit whose
value is 1 and will be let through
• since the majority of emails are spam (about 80%
according to some reports),
• eliminating 7/8th of the spam is a significant benefit

41
Bloom Filtering – Example
• If we want to eliminate every spam, we need only check
for membership in S those good and bad emails that get
through the filter.

• Those checks will require the use of secondary memory to

access S itself.

• As a simple example, we could use a cascade of filters,

each of which would eliminate 7/8th of the remaining
spam.

42
Bloom Filter
A Bloom filter consists of:
1. An array of n bits, initially all 0’s.
2. A collection of hash functions h1, h2, . . . , hk.
Each hash function maps “key” values to n buckets,
corresponding to the n bits of the bit-array
3. A set S of m key values.

• Purpose of the Bloom filter is to allow through all

stream elements whose keys are in S, while
rejecting most of the stream elements whose keys
are not in S

43
Bloom Filter

n = 10
44
Bloom Filter

k=3
45
Bloom Filter

k=3
46
Bloom Filter – contd…
• To initialize the bit array, begin with all bits 0.

• Take each key value in S and hash it using each of the k hash
functions.

• Set to 1 each bit that is hi(K).

• For some hash function hi and some key value K in S.

• To test a key K that arrives in the stream, check that all of h 1(K),
h2(K), . . . , hk(K) are 1’s in the bit-array.

• If all are 1’s, then let the stream element through. One or more of
these bits are 0, then K could not be in S, so reject the stream
element.
47
Counting Distinct elements in a
stream
• Consider the problem of counting distinct elements in a
stream.

• Counting the number of unique users logged in.

• Use Efficient search structures like hashing.

• If the number of distinct elements is too high, then we need

more main memory or more machines.

48
Counting Distinct elements in a
stream
• Consider a generalization of the problem of counting
distinct elements in a stream.

• The problem, called computing “moments,” involves the

distribution of frequencies of different elements in the
stream

49
Moments

50
Moments – contd…

51
Surprise number

Stream with 100 elements with 11 distinct ones

i)10 elements occur 9 times and one element

occurs 10 times

Surprise number ( 2nd moment) =

9 2 *10 +(10)2 *1 = 910

ii) 10 elements occur 1 time, one element 90

times
Surprise number ( 2nd moment) =
1 2 *10 +(90)2 *1 = 8110

52
Alon-Matias-Szegedy (AMS)
algorithm
• Suppose we do not have enough space to
count all the mi’s for all the elements of the
stream.

• Can still estimate the second moment of the

stream using a limited amount of space.

• The more space we use, the more accurate

the estimate will be.

• We compute some number of variables

53
Alon-Matias-Szegedy (AMS)
algorithm
Lets define
X = (element, value)
X.element : element of the universal set
X.value : counter of X.element in the stream starting at a randomly chosen
position
Example :
n = 15
ma 2+ mb2+mc2+md2 = 5 2 + 4 2 + 3 2 + 3 2 = 59
a b c b d a c d a b d c a a b

54
AMS algorithm – contd…

To determine the value of a variable X,

•We choose a position in the stream between 1 and
n, at random.
• Set X.element to be the element found there and
initialize X.value to 1.
• As we read the stream, add 1 to X.value each
time we encounter another occurrence of
X.element.

55
AMS algorithm – contd…

(1) Randomly pick 3 positions with known length.

a b c b d a c d a b d c a a b

(3 variables to th 2n order) strea

X1 = (c,1) compute e d from m
(2) Process the
a stream,
b c oneb d a c d a b d c a a b
element at a
time.
X1 = (c, 2)
X2 = X2 = (d, 2)
(d,1) X1 = (c, 3)
X3 = (a,1)
X3 =
56
(a, 2)
AMS algorithm – contd…
• Estimate of Second order moment from any
X = (element, value)
Estimate = n * (2 * X.value -1)

• estimate from X1: 15 (2 3 -1) = 75

• estimate from X2 : 15 * (2 * 2 -1) = 45
• estimate from X3: 15 * (2 *2 -1) = 45

• Average(X1, X2 , X3 ) = 55
• True value for our stream: 59

57
Higher order moments

58
Exercise
Compute the surprise number (second moment) for the stream
3, 1, 4, 1, 3, 4, 2, 1, 2. What is the third moment of this stream?

59
Solution for Exercise

3, 1, 4, 1, 3, 4, 2, 1, 2
Occurrence of 3 = 2 times
Occurrence of 1 = 3 times
Occurrence of 4 = 2 time
Occurrence of 2 = 2 times
Surprise number = 2 2 *3 + 3 2 *1 = 21
Third order moment = 2 3 + 3 3 + 2 3 + 2 3 = 51

60
Dealing with infinite streams
• Estimate we used for second and higher moments assumes
that n, the stream length, is a constant.

• In practice, n grows with time.

• That fact doesn’t cause problems, since we store only the

values of variables and multiply some function of that value
by n when it is time to estimate the moment.

• We can count the number of stream elements seen and store

this value,(which only requires log n bits)

61
Dealing with infinite streams –
contd…
 Problem - we must be careful how we select the positions for the variables
 If we do this selection once and for all, then as the stream gets longer,
- we are biased in favour of early positions,
- the estimate of the moment will be too large
 if we wait too long to pick positions, then
- early in the stream we do not have many variables
- will get an unreliable estimate.

62
Dealing with infinite streams – contd…
 Solution :
• Maintain as many variables as we can store at all times, and throw
some out as the stream grows.

• The discarded variables are replaced by new ones,

• At all times, the probability of picking any one position for a variable
is the same as that of picking any other position.

• Suppose we have space to store s variables, first s positions are each

picked as the position of one of the s variables.

63
Counting Bits – (1)

• Problem: given a stream of 0’s and 1’s, be prepared to answer

queries of the form “how many 1’s in the last k bits?” where k ≤ N.
• Obvious solution: store the most recent N bits.
• When new bit comes in, discard the N +1st bit.

64
Counting Bits – (2)

• You can’t get an exact answer without storing the entire window.
• Real Problem: what if we cannot afford to store N bits?
• E.g., we are processing 1 billion streams and N = 1 billion
But we’re happy with an approximate answer.

65
Something That Doesn’t (Quite)
Work

• Summarize exponentially increasing regions of the stream, looking

backward.
• Drop small regions if they begin at the same point as a larger region.

66
Example

We can construct the count of

the last N bits, except we’re
Not sure how many of the last
6 are included.
6 10
? 4
3 2
1 2
10
001110001010010001011011011100101011001101

67
What’s Good?

• Stores only O(log2N ) bits.

• O(log N ) counts of log2N bits each.
• Easy update as more bits enter.
• Error in count no greater than the number of 1’s in the “unknown”
area.

68
What’s Not So Good?

• As long as the 1’s are fairly evenly distributed, the error due to the
unknown region is small – no more than 50%.
• But it could be that all the 1’s are in the unknown area at the end.
• In that case, the error is unbounded.

69
Fixup

• Instead of summarizing fixed-length blocks, summarize blocks with

specific numbers of 1’s.
• Let the block sizes (number of 1’s) increase exponentially.
• When there are few 1’s in the window, block sizes stay small, so
errors are small.

CSE545 Sp23 (2) Streaming Algorithms 2-4
No ratings yet
CSE545 Sp23 (2) Streaming Algorithms 2-4
60 pages
Computer Network - TCP - IP Model
No ratings yet
Computer Network - TCP - IP Model
5 pages
Recommender System With Sentiment Analysis: Summer Internship Report
No ratings yet
Recommender System With Sentiment Analysis: Summer Internship Report
58 pages
Calculus II Chapter 7 ALL Lecture Notes
No ratings yet
Calculus II Chapter 7 ALL Lecture Notes
59 pages
Project Report GLS688 - Geovisualization - 2021290434 - Muhamad Hafidz Syah Amir
No ratings yet
Project Report GLS688 - Geovisualization - 2021290434 - Muhamad Hafidz Syah Amir
12 pages
ADSA Lab Manual Newnew
No ratings yet
ADSA Lab Manual Newnew
99 pages
DSC L2 CQL
No ratings yet
DSC L2 CQL
40 pages
Data Stream Mg
No ratings yet
Data Stream Mg
528 pages
System Language: Understanding Systems George Mobus: Corresponding Author: Gmobus@uw - Edu
No ratings yet
System Language: Understanding Systems George Mobus: Corresponding Author: Gmobus@uw - Edu
28 pages
Protocol Studio Man KT
No ratings yet
Protocol Studio Man KT
156 pages
2006 07 Hub
No ratings yet
2006 07 Hub
62 pages
Example Literature Review Concept Map
100% (2)
Example Literature Review Concept Map
7 pages
Migrating 32bit Applications To 64bit v2021
No ratings yet
Migrating 32bit Applications To 64bit v2021
18 pages
Oracle+NetSuite+Service+Descriptions
No ratings yet
Oracle+NetSuite+Service+Descriptions
42 pages
Backup AND Restore / Update Firmware: Doug Wendyker
No ratings yet
Backup AND Restore / Update Firmware: Doug Wendyker
12 pages
Unit2 Bda
No ratings yet
Unit2 Bda
293 pages
Mmd04A Streams
No ratings yet
Mmd04A Streams
78 pages
Data Structure Module-3 Queue
No ratings yet
Data Structure Module-3 Queue
42 pages
Val Grind
No ratings yet
Val Grind
14 pages
Data Stream Management
No ratings yet
Data Stream Management
46 pages
mining data stream
No ratings yet
mining data stream
31 pages
FALLSEM2024-25_SWE2011_ETH_VL2024250103282_2024-08-19_Reference-Material-I
No ratings yet
FALLSEM2024-25_SWE2011_ETH_VL2024250103282_2024-08-19_Reference-Material-I
53 pages
Seminar 1. Updated Version
No ratings yet
Seminar 1. Updated Version
13 pages
Bda Unit II Lecture1
No ratings yet
Bda Unit II Lecture1
10 pages
Data Analytics and Visualization Unit-III
No ratings yet
Data Analytics and Visualization Unit-III
21 pages
Ch05a Streams1
No ratings yet
Ch05a Streams1
48 pages
Telecom Case Study - Solution - Saurabh Kumar
No ratings yet
Telecom Case Study - Solution - Saurabh Kumar
5 pages
p1 Babcock
No ratings yet
p1 Babcock
16 pages
Bda L4
No ratings yet
Bda L4
32 pages
Unit Ii BD
No ratings yet
Unit Ii BD
74 pages
6_2024_03_25!09_17_04_PM (1)
No ratings yet
6_2024_03_25!09_17_04_PM (1)
15 pages
Ijgi 09 00257
No ratings yet
Ijgi 09 00257
16 pages
Khiet Pham: Work Experiences Skills & Competences
No ratings yet
Khiet Pham: Work Experiences Skills & Competences
5 pages
Mining Data Streams
No ratings yet
Mining Data Streams
33 pages
UNIT-2 BDA
No ratings yet
UNIT-2 BDA
33 pages
IoT Communication Protocols
No ratings yet
IoT Communication Protocols
9 pages
4 Bda Chapter4 Answer
No ratings yet
4 Bda Chapter4 Answer
6 pages
Bda Ut-2
No ratings yet
Bda Ut-2
18 pages
Iot Full Notes
No ratings yet
Iot Full Notes
51 pages
Swe2011 Bda - III
No ratings yet
Swe2011 Bda - III
53 pages
NSR Registration Demo Infosys BPM Limited
No ratings yet
NSR Registration Demo Infosys BPM Limited
40 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
14 pages
Big Data Analytics_Unit 3
No ratings yet
Big Data Analytics_Unit 3
64 pages
Unit III - MMD - Lecture Notes
No ratings yet
Unit III - MMD - Lecture Notes
8 pages
DWDM - Unit - VII
No ratings yet
DWDM - Unit - VII
42 pages
Python Style Guide - How To Write Neat and Impressive Python Code
No ratings yet
Python Style Guide - How To Write Neat and Impressive Python Code
14 pages
UNIT-II 30-1-24
No ratings yet
UNIT-II 30-1-24
162 pages
BDA Module-4
No ratings yet
BDA Module-4
8 pages
Lab 2
No ratings yet
Lab 2
5 pages
Introduction To Stream Concepts - Stream Data Model and Architecture
No ratings yet
Introduction To Stream Concepts - Stream Data Model and Architecture
8 pages
Unit 4
No ratings yet
Unit 4
84 pages
Counting Ones in a Window
No ratings yet
Counting Ones in a Window
27 pages
Swe2011 Bda - III
No ratings yet
Swe2011 Bda - III
50 pages
Bda M4
No ratings yet
Bda M4
57 pages
BDA Mod 3
No ratings yet
BDA Mod 3
57 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
57 pages
BDA Unit-2
No ratings yet
BDA Unit-2
12 pages
MMD3
No ratings yet
MMD3
17 pages
Unit 3
No ratings yet
Unit 3
30 pages
Unit-2 BDA
No ratings yet
Unit-2 BDA
30 pages
BigData_Mod2
No ratings yet
BigData_Mod2
12 pages
Bda Mid Ans
No ratings yet
Bda Mid Ans
18 pages
1-Information Technology, The Internet and You
No ratings yet
1-Information Technology, The Internet and You
25 pages
Data Stream Processing - An Overview: Sangeetha Seshadri Sangeeta@cc - Gatech.edu
No ratings yet
Data Stream Processing - An Overview: Sangeetha Seshadri Sangeeta@cc - Gatech.edu
68 pages
Linux Vs Windows Commands For Beginner
50% (2)
Linux Vs Windows Commands For Beginner
1 page
Mining Data Streams
No ratings yet
Mining Data Streams
37 pages
3. Unit 3 - BD - Streaming
No ratings yet
3. Unit 3 - BD - Streaming
42 pages
Mining Data Streams (Part 1)
No ratings yet
Mining Data Streams (Part 1)
46 pages
a.
No ratings yet
a.
3 pages
Big Data Unit III
No ratings yet
Big Data Unit III
20 pages
BIG_DATA_UNIT_II_NOTES
No ratings yet
BIG_DATA_UNIT_II_NOTES
19 pages
Unit-II (Big Data)
No ratings yet
Unit-II (Big Data)
20 pages
Bigdata Unit-Ii
No ratings yet
Bigdata Unit-Ii
33 pages
Unit II(Big Data)
No ratings yet
Unit II(Big Data)
19 pages
Real Time Data Stream Processing Engine
No ratings yet
Real Time Data Stream Processing Engine
13 pages
Sophos Firewall Vs Checkpoint BC
No ratings yet
Sophos Firewall Vs Checkpoint BC
8 pages
Bigdata-Mining Data Streams
No ratings yet
Bigdata-Mining Data Streams
19 pages
Bisection Method Codes
No ratings yet
Bisection Method Codes
6 pages
Module-2-MINING DATA STREAMS
100% (3)
Module-2-MINING DATA STREAMS
17 pages
UNIT-3 (Mining Data Streams)
No ratings yet
UNIT-3 (Mining Data Streams)
50 pages
Introduction To Stream Data Model
50% (2)
Introduction To Stream Data Model
15 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
19 pages
E560 FD Release 9 0 PDF
No ratings yet
E560 FD Release 9 0 PDF
220 pages
001 ABAP CDS - Key Definition Tips
No ratings yet
001 ABAP CDS - Key Definition Tips
6 pages
Industrial Training Report On PHP
50% (6)
Industrial Training Report On PHP
42 pages
Unit-II BDA
No ratings yet
Unit-II BDA
19 pages
BA7205 INFORMATION - MANAGEMENT - PDF Notes PDF
67% (3)
BA7205 INFORMATION - MANAGEMENT - PDF Notes PDF
168 pages
Unit 4 Notes PDF
100% (2)
Unit 4 Notes PDF
27 pages
Big Data Analytics Unit 2 MINING DATA STREAMS
100% (2)
Big Data Analytics Unit 2 MINING DATA STREAMS
22 pages
5.1 Mining Data Streams
No ratings yet
5.1 Mining Data Streams
16 pages
Module II
No ratings yet
Module II
22 pages
Important Multiple Choice Questions: Computer System
100% (4)
Important Multiple Choice Questions: Computer System
14 pages
World’s First AC-Powered Multi-Parameter Processor: A Journey Beyond Limits
From Everand
World’s First AC-Powered Multi-Parameter Processor: A Journey Beyond Limits
RAJKUMAR OJHA
No ratings yet
Quantum Computer Vs Traditional Computer
From Everand
Quantum Computer Vs Traditional Computer
Arief Muinnudin
No ratings yet